Lead Site Reliability Engineer (SRE) jobs in United States
cer-icon
Apply on Employer Site
company-logo

AT&T · 15 hours ago

Lead Site Reliability Engineer (SRE)

AT&T is a leading communications and technology company that connects the world. They are seeking a Lead Site Reliability Engineer (SRE) to design and implement robust, scalable, and reliable technical solutions that ensure high availability and performance in cloud and data environments.

CollaborationCommunications InfrastructureMobileService IndustryTelecommunicationsWireless
check
H1B Sponsor Likelynote
Hiring Manager
Eric Hoagberg, Lead Talent Acquisition Manager, MBA
linkedin

Responsibilities

The EngOps Tier 2/SRE team ensures applications and systems are highly reliable, scalable, and performant while fostering a collaborative culture between development and operations
Work with T1 team on incident as Triage lead during outages or critical issues Pager duty issues
Minimize downtime and user impact during incidents
Conduct detailed After Action Reviews involving all stakeholders and chalk out short term and long-term resiliency options
Eliminate recurrence of similar issues through systemic fixes
Define and implement monitoring and alerting strategies tailored to the launch
Collaborate with Product development teams to gain deep insight into the application architecture, flows and critical dependencies
Monitor and evaluate key performance metrics like latency, throughput, and error rates and update alerts
Propose architectural or operational changes to prevent reoccurrence
Reduce Mean Time to Resolution (MTTR) for incidents

Qualification

Site Reliability EngineeringCloud platformsPythonJavaAI/ML conceptsPerlJavaScriptNetwork standardsCollaborationProblem-solving

Required

Bachelor's degree in computer science, Information Systems, or a related discipline
Over 10 years hands-on experience in architecting and building scalable platforms and applications in cloud/data environments
8+ Years of experience in Python, Perl, Java, Javascript, and Site Reliability Engineering

Preferred

Practical understanding of AI/ML concepts and their integration in enterprise platforms

Benefits

Medical/Dental/Vision coverage
401(k) plan
Tuition reimbursement program
Paid Time Off and Holidays (based on date of hire, at least 23 days of vacation each year and 9 company-designated holidays)
Paid Parental Leave
Paid Caregiver Leave
Additional sick leave beyond what state and local law require may be available but is unprotected
Adoption Reimbursement
Disability Benefits (short term and long term)
Life and Accidental Death Insurance
Supplemental benefit programs: critical illness/accident hospital indemnity/group legal
Employee Assistance Programs (EAP)
Extensive employee wellness programs
Employee discounts up to 50% off on eligible AT&T mobility plans and accessories
AT&T internet (and fiber where available) and AT&T phone.

Company

AT&T is a telecommunications company that provides wireless communications, internet and digital television services.

H1B Sponsorship

AT&T has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (575)
2024 (586)
2023 (282)
2022 (450)
2021 (271)
2020 (162)

Funding

Current Stage
Public Company
Total Funding
$5.04B
Key Investors
National Telecommunications and Information Administration
2025-09-19Post Ipo Debt· $5B
2024-02-12Grant· $42.3M
2023-01-19Grant· $2.2M

Leadership Team

leader-logo
Jeremy Legg
Chief Technology Officer
linkedin
leader-logo
Pascal Desroches
Senior Executive Vice President and Chief Financial Officer
linkedin

Recent News

PCMag.com - Technology Product Reviews, News, Prices & Tips
Company data provided by crunchbase