Manager Site Reliability Engineering jobs in United States
cer-icon
Apply on Employer Site
company-logo

Sphere Entertainment Co. · 3 days ago

Manager Site Reliability Engineering

Sphere Entertainment Co. is a premier live entertainment and media company that focuses on redefining the future of entertainment. They are seeking a Manager of Site Reliability Engineering to lead platform stability, scalability, and security efforts for their digital sports streaming application, ensuring the reliability of AWS-based infrastructure and guiding architectural decisions.

ConcertsEventsMedia and Entertainment

Responsibilities

Own the reliability, performance, and security of the platform infrastructure that supports our live and on-demand video streaming app
Lead and grow a small technical team (SRE, VideoOps) and act as a hands-on mentor and contributor
Design and maintain robust monitoring, logging, and alerting systems, using tools such as CloudWatch, Datadog, and Conviva, to ensure visibility into platform health, fast incident response, and high availability across our video streaming infrastructure
Define and enforce operational best practices including disaster recovery, redundancy, backup, and failover strategies
Investigate and resolve complex issues across the application stack, from infrastructure and APIs to video delivery and playback
Lead incident response efforts and participate in an on-call rotation during peak traffic events (typically evenings EST)
Collaborate with Product and Engineering teams to guide architectural decisions that prioritize platform resilience, scalability, and security
Partner with L1 Operations and Customer Care teams to triage issues, drive incident resolution, and close the loop on recurring or systemic problems
Own the implementation and continuous strengthening of platform security, including identity management, secrets handling, IAM policies, and AWS-level hardening
Evaluate and introduce new tools, technologies, and architectural patterns to improve the reliability of the system
Track and improve SLAs, SLOs, and operational KPIs related to uptime, latency, video playback quality, and security posture

Qualification

AWS infrastructureVideo streaming architectureSystem observabilityPlatform securityScriptingAutomationCI/CD pipelinesAnalytical skillsCommunicationTeam leadership

Required

5+ years of experience in SRE, DevOps, or platform infrastructure roles, with 2+ years in a team lead or manager capacity
Experience operating and scaling production environments in AWS, including services like CloudFront, Lambda, S3, API Gateway, and CloudWatch
AWS Certification (Solutions Architect, DevOps Engineer, or similar) or equivalent deep hands-on experience
Strong background in system observability, with experience using tools like Conviva, CloudWatch, and Datadog for monitoring, distributed tracing, and alerting
Deep understanding of video streaming architecture including HLS/DASH, CDNs, DRM, SSAI, and multi-platform delivery (mobile, web, CTV)
Expertise in scripting and automation using Python, Bash, or similar, with infrastructure-as-code tools like Terraform or CloudFormation
Proven ability to lead platform security initiatives, including IAM policy management, token handling, and securing service architecture
Experience collaborating with engineering teams to improve CI/CD pipelines, automate infrastructure changes, and support safe production releases
Strong analytical and troubleshooting skills across application, network, and video delivery layers
Excellent communication skills with the ability to drive cross-functional alignment and manage vendor relationships
Participation in an after-hours on-call rotation is expected, particularly during live sporting events and high-traffic periods

Benefits

Robust set of tools and resources to help employees understand their interests and purpose
Upskilling employees’ talents and strengths
Growth and longevity for our employees are top priorities here.

Company

Sphere Entertainment Co.

twittertwitter
company-logo
Sphere Entertainment Co. is a premier live entertainment and media company.

Funding

Current Stage
Public Company
Total Funding
$225M
Key Investors
Point72
2024-06-25Post Ipo Equity
2023-12-05Post Ipo Debt· $225M
2020-04-09IPO

Leadership Team

leader-logo
Felicia Yue
Executive Vice President, Chief Technology Officer
linkedin
leader-logo
Robert Langer
Chief Financial Officer and Treasurer
linkedin
Company data provided by crunchbase