Lead Observability Engineer – Sumo Logic jobs in United States
cer-icon
Apply on Employer Site
company-logo

E-Solutions · 6 hours ago

Lead Observability Engineer – Sumo Logic

E-Solutions is seeking a highly skilled Lead Observability Engineer to lead a critical implementation of Sumo Logic for a client migrating from Dynatrace. This role requires deep expertise in Sumo Logic, Site Reliability Engineering (SRE) practices, and Kubernetes observability, focusing on designing and implementing scalable solutions for monitoring and reliability.

ConsultingHuman ResourcesRecruitingStaffing Agency
check
H1B Sponsor Likelynote
Hiring Manager
Pramod Singh
linkedin

Responsibilities

Lead the end-to-end implementation of Sumo Logic observability platform for AWS and EKS environments
Migrate monitoring and alerting assets from Dynatrace to Sumo Logic
Define and implement SLIs/SLOs, error budgets, and reliability metrics for containerized services
Deploy and configure Sumo Logic collectors across AWS and Kubernetes workloads (EKS)
Configure log, metric, and trace ingestion pipelines using OpenTelemetry and Sumo Logic apps
Design and maintain dashboards for service health, performance, and reliability insights
Implement intelligent alerting and notification workflows, using thresholds, baselines, and anomaly detection
Collaborate with DevOps, SRE, and development teams to ensure complete tracing coverage across services
Ensure best practices for alert noise reduction, escalation policies, and incident response are in place
Contribute to observability runbooks, operational handover, and training for the client SRE team

Qualification

Sumo LogicSite Reliability EngineeringKubernetes (EKS)OpenTelemetryAWSAdvanced troubleshootingProactive recommendations

Required

Deep expertise in Sumo Logic
Site Reliability Engineering (SRE) practices
Kubernetes (EKS) observability
Lead the end-to-end implementation of Sumo Logic observability platform for AWS and EKS environments
Migrate monitoring and alerting assets from Dynatrace to Sumo Logic
Define and implement SLIs/SLOs, error budgets, and reliability metrics for containerized services
Deploy and configure Sumo Logic collectors across AWS and Kubernetes workloads (EKS)
Configure log, metric, and trace ingestion pipelines using OpenTelemetry and Sumo Logic apps
Design and maintain dashboards for service health, performance, and reliability insights
Implement intelligent alerting and notification workflows, using thresholds, baselines, and anomaly detection
Collaborate with DevOps, SRE, and development teams to ensure complete tracing coverage across services
Ensure best practices for alert noise reduction, escalation policies, and incident response are in place
Contribute to observability runbooks, operational handover, and training for the client SRE team
Strong knowledge of the new UI navigation
Proven expertise in building and optimizing queries
Advanced troubleshooting skills
The ability to go beyond task execution and provide proactive recommendations to improve our setup and overall efficiency

Company

E-Solutions

company-logo
E-Solutions is a talent acquisition company that offers staffing, application development, and strategic outsourcing services.

H1B Sponsorship

E-Solutions has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (6)
2024 (11)
2023 (12)
2022 (11)
2021 (12)
2020 (16)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
ANKIT MISHRA
Head – Google Cloud Platform (GCP) - Global Delivery Practice | Client Partnering | Cloud Computing
linkedin
Company data provided by crunchbase