Lead Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

McGraw Hill · 7 hours ago

Lead Site Reliability Engineer

McGraw Hill is a leading provider of digital educational resources and content, and they are seeking a Lead Site Reliability Engineer to lead a team of 6 Engineers for their Digital Platform Group. This role involves ensuring the reliability, scalability, and performance of K–12 learning platforms by collaborating with engineering and product teams and leveraging expertise in cloud infrastructure.

E-LearningEdTechEducationPublishing
check
H1B Sponsor Likelynote

Responsibilities

Lead a 6 member SRE team supporting production infrastructure and services
Manage backlog, sprint planning, and team velocity
Own reliability, uptime, security, cost, and performance of services
Define and monitor SLOs for application workloads
Plan on-call rotations and work to reduce alert fatigue
Forecast seasonal growth and capacity planning
Mentor engineers and foster professional growth
Report status and issues to leadership monthly
Partner with development teams
Collaborate with CyberSecurity on risk mitigation
Collaborate with FinOps on cost reduction
Design and troubleshoot highly-distributed, cloud-based production systems
Maintain infrastructure-as-code and monitoring-as-code practices
Improve system resiliency through failure injection and chaos testing
Participate in on-call rotation and resolve operational issues
Optimize existing systems for performance and cost
Ensure telemetry provides visibility to application performance
Support agile development practices and code reviews

Qualification

AWSTerraformSite Reliability EngineeringCI/CD pipelinesObservability toolsAgile developmentProblem-solvingTeam leadership

Required

5+ years of experience in SRE, DevOps, or Software Engineering roles supporting enterprise applications
Strong problem-solving, triage, and root cause analysis skills with a systems engineering mindset
Deep expertise in the AWS ecosystem, with hands-on experience across core services including primarily ECS, RDS, EKS, IAM, CloudWatch, and networking configurations
Expertise with Terraform for managing and automating scalable cloud infrastructure
Skilled in CI/CD pipelines (e.g., GitHub Actions) and managing end-to-end software delivery lifecycles
Strong familiarity with telemetry and observability tools (e.g., New Relic, Datadog), including querying logs and metrics for performance monitoring

Benefits

A full range of medical and/or other benefits may be provided, depending on the position offered.

Company

McGraw Hill

company-logo
We are a leading global education company that partners with millions of educators, learners and professionals around the world.

H1B Sponsorship

McGraw Hill has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (27)
2024 (13)
2023 (23)
2022 (37)
2021 (27)
2020 (28)

Funding

Current Stage
Public Company
Total Funding
unknown
2025-07-24IPO
2021-09-30Private Equity
2021-06-15Acquired

Leadership Team

leader-logo
Lloyd G. Waterhouse
CEO & President
leader-logo
Simon Allen
President & CEO
linkedin
Company data provided by crunchbase