Experienced Lead Site Reliability Engineer with 13+ years in platform engineering, observability, and automation. Expertise in SLI/SLO frameworks, OpenTelemetry, MELT strategy, and large-scale distributed systems. Proven success in driving SRE best practices, enhancing cloud-native monitoring (Datadog, New Relic, Prometheus), and optimizing system resilience. Adept in leading incident management, mentoring engineers, and transitioning from reactive to proactive reliability strategies.