I am an experienced Application Support Engineer and Systems Reliability Engineer with more than 15 years of experience supporting enterprise systems, telecom platforms, and cloud-based applications. I specialize in production monitoring, incident management, and system reliability, ensuring that business-critical systems remain stable and available.
I have strong experience in log analysis and observability using Splunk, where I analyze application logs, investigate alerts, and assist in troubleshooting performance issues. I also help improve monitoring processes by supporting dashboard development, alert optimization, and documentation of monitoring workflows.
My background includes extensive work with Linux/Unix servers, scripting, and cloud platforms (AWS and GCP). I have supported production environments for large organizations where uptime, performance, and quick incident resolution are critical.