Underdog · 8 hours ago
Senior Site Reliability Engineer - Infrastructure
Underdog is a rapidly growing sports company focused on creating fun and engaging products for sports fans. They are seeking a Senior Site Reliability Engineer to help define and improve their reliability, scalability, and operational excellence as the company grows, with responsibilities including incident response management and capacity planning.
eSportsFantasy SportsGaming
Responsibilities
Own and maintain the incident response process, including defining procedures, tools, and best practices
Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems
Lead capacity planning initiatives, focusing on both short and long-term scalability while optimizing costs
Develop and implement disaster recovery plans, including regular testing and regulatory compliance
Collaborate with teams on architecture decisions to ensure high availability and scalability
Manage launch and event planning for high-traffic occasions, focusing on infrastructure preparation and capacity management (a.k.a. Launch Readiness)
Act as an internal expert and consultant for monitoring tools like Datadog and Pagerduty and infrastructure like AWS and Kubernetes
Emphasis on automation and tooling to scale our workload
Contribute across codebases in Ruby, Python, Go, TypeScript, Swift, and Kotlin as needed to support the initiatives described above
Qualification
Required
Own and maintain the incident response process, including defining procedures, tools, and best practices
Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems
Lead capacity planning initiatives, focusing on both short and long-term scalability while optimizing costs
Develop and implement disaster recovery plans, including regular testing and regulatory compliance
Collaborate with teams on architecture decisions to ensure high availability and scalability
Manage launch and event planning for high-traffic occasions, focusing on infrastructure preparation and capacity management (a.k.a. Launch Readiness)
Act as an internal expert and consultant for monitoring tools like Datadog and Pagerduty and infrastructure like AWS and Kubernetes
Emphasis on automation and tooling to scale our workload
Contribute across codebases in Ruby, Python, Go, TypeScript, Swift, and Kotlin as needed to support the initiatives described above
A strong written and verbal communicator
Collaborative by nature
Someone who enjoys using research, data, and experiments to make decisions; you believe 'Hope is not a strategy.'
You enjoy working directly with customers (generally engineers or other people inside the company)
You think long-term about what is best for the business and its customers
You are excited to take ownership
You are very comfortable around an IDE, working with multiple languages, multiple web application frameworks, AWS services, Kubernetes, PostgreSQL
You can work independently to learn new languages/technologies as needed
You enjoy deploying changes to production quickly, multiple times a week if necessary
Preferred
Experience with PostgreSQL SQL query optimization, tweaking autovacuum settings, table statistics, different index types, etc
Experience with Redis / Valkey Optimization
Experience with Datadog or similar observability tools
Experience working as a web application developer, frontend or backend, especially in React and Ruby on Rails
Experience with AWS cost optimization
Read the Google SRE books or similar books, or have other forms of SRE training
Actively leveraging the capabilities of AI to augment abilities and gain knowledge about interested domains
Benefits
Unlimited PTO (we're extremely flexible with the exception of the first few weeks before & into the NFL season)
16 weeks of fully paid parental leave
Home office stipend
A connected virtual first culture with a highly engaged distributed workforce
5% 401k match
FSA
Company paid health, dental, vision plan options for employees and dependents
Company
Underdog
We’re the fastest-growing sports gaming company ever. Our mission is to build innovative games and products for American sports fans.
H1B Sponsorship
Underdog has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (1)
2023 (1)
Funding
Current Stage
Growth StageTotal Funding
$115MKey Investors
Spark CapitalBlackRockKevin Carter
2025-03-26Series C· $70M
2022-07-26Series B· $35M
2021-05-03Series A· $10M
Recent News
Mergers & Acquisitions
2025-10-19
2025-10-18
Company data provided by crunchbase