SIGN IN
Senior Lead Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Zoom · 10 hours ago

Senior Lead Site Reliability Engineer

Zoom is a collaboration platform company dedicated to helping people communicate better. The Senior Lead Site Reliability Engineer will be responsible for managing hybrid systems, developing automation, and ensuring optimal performance across global data centers.
CollaborationInformation TechnologyMessagingSaaSVideo Conferencing
badNo H1Bnote

Responsibilities

Providing technical direction for cross-team initiatives and major incidents
Mentor SRE's and developers; define best practices and design patterns
Partner with Security, Networking, and Platform teams on architecture roadmaps
Influence vendor and hardware strategy for on-prem and cloud workloads
Design self-healing platforms using automation, chaos engineering, and fault-tolerant patterns
Optimize Linux systems at scale: performance tuning, kernel parameters, networking, storage, and security hardening
Define best practices and advocate for them across the company
Excellent communication skills and experience driving cross team projects as a technical lead
Able to participate in on-call shifts and incident management and work after hours/weekends for application releases/deployments

Qualification

Linux system administrationConfiguration managementIncident responseNetworking expertiseCoding abilityChaos engineeringDistributed storage systemsSecurity-first mindsetSoft skills

Required

10+ years in SRE, production engineering, or large-scale systems administration
Have experience of Linux system administration (systemd, cgroups, networking, filesystems, performance analysis)
Demonstrate coding ability with at least one programming language e.g. Python
Have experience with configuration management (Ansible), IaC (Terraform, Packer), CI/CD pipelines (Jenkins, GitLab), container orchestration (k8s, Docker) and observability platforms
Have experience with incident response for mission-critical environments
Possess a security -first mindset (TPM, secure boot, identity, secrets management)
Demonstrate networking expertise: BGP, load balancing, DNS, TLS, traffic engineering
Have experience with chaos engineering and resilience testing
Have experience with distributed storage systems such as Ceph

Benefits

Variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways.

Company

Zoom

twittertwittertwitter
company-logo
Zoom is a software company that offers a communications platform that connects people through video, voice, chat, and content sharing.

Funding

Current Stage
Public Company
Total Funding
$276M
Key Investors
ARK Investment ManagementSequoia CapitalEmergence Capital
2021-11-04Post Ipo Equity· $130M
2019-04-19Post Ipo Equity
2019-04-18IPO

Leadership Team

leader-logo
Eric Yuan
Founder & CEO
linkedin
leader-logo
Xuedong Huang
Chief Technology Officer
linkedin
Company data provided by crunchbase