Elsevier · 3 hours ago
Manager Site Reliability Engineering
Elsevier is a global provider of information-based analytics and decision tools for professional and business customers, and they are seeking a Manager of Site Reliability Engineering. This role involves leading multiple SRE teams to ensure alignment with SRE frameworks, promoting best practices, and driving cloud reliability and performance initiatives.
ContentContent DiscoveryDeliveryHealth CareInformation ServicesInformation TechnologyPublishing
Responsibilities
Managing high performance SRE teams ideally in multiple counties. We are not looking for an individual contributor
Promoting and implementing Site Reliability Engineering best practices and principles across product and platform teams
Architecting, implementing, and managing infrastructure using Infrastructure as Code (IaC) and DevOps principles
Designing and maintaining secure-by-default cloud-native systems with a focus on continuous improvement of security posture
Defining and enforcing SLA/SLI/SLO standards for production systems
Developing and maintaining automated frameworks for provisioning, deployment, scaling, and monitoring
Conducting in-depth troubleshooting of complex production issues across application, infrastructure, and network layers
Leading proof-of-concept efforts to evaluate and introduce new technologies
Implement policy and compliance checks within CI/CD pipelines
Qualification
Required
Current and extensive experience managing teams of SRE's. We are not looking to hire an individual contributor in this role
Proficiency with at least one major public cloud provider: Azure, AWS
Extensive experience with Terraform, Ansible, and other IaC/orchestration tools
Expertise in Kubernetes (AKS/EKS/GKE), containerized workloads, and deployment strategies (e.g., Blue Green)
Deep knowledge of Linux and Windows server environments
Proven experience in building and enforcing automation frameworks for CI/CD and infrastructure provisioning
Hands-on experience with observability platforms such as Grafana, Kibana, Splunk, ELK Stack (Elasticsearch, Logstash, Kibana), OpenTelemetry, Prometheus, Loki
Strong knowledge of SLAs, SLIs, and SLOs and their application in production environments
Experience with monitoring, alerting, and logging best practices
Solid understanding of cloud-native security, identity management, and secrets management (e.g., HashiCorp Vault)
Skilled in scripting and programming (e.g., Python, Bash, Golang, PowerShell, C#)
Strong knowledge of networking, application performance tuning, and troubleshooting
Familiarity with common CI/CD and version control tools (e.g., Git, GitLab, GitHub, Jenkins)
Benefits
Country specific benefits
Company
Elsevier
Elsevier is a world-leading provider of information solutions that enhance the performance of science, health, and technology. It is a sub-organization of RELX.
H1B Sponsorship
Elsevier has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (32)
2024 (17)
2023 (28)
2022 (46)
2021 (28)
2020 (19)
Funding
Current Stage
Late StageTotal Funding
unknown2003-09-01Private Equity
Recent News
News-Medical.Net
2026-01-16
2025-12-18
Business Wire
2025-12-17
Company data provided by crunchbase