Senior Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Overstory · 3 weeks ago

Senior Site Reliability Engineer

Overstory is addressing the climate crisis by leveraging advanced technology to enhance the resilience of the electrical grid. The Senior Site Reliability Engineer will focus on managing GCP infrastructure and improving DevOps practices to ensure operational excellence and reliability across engineering teams.

AnalyticsArtificial Intelligence (AI)Big DataMachine LearningSoftware

Responsibilities

Design and evolve Overstory’s cloud infrastructure to support the company’s scaling needs, laying the foundation for performance, security, and maintenance
Build tooling and automation that promote team autonomy while ensuring operational excellence
Advance our observability platform to support long-term insights, meaningful alerting and improved ease of use for the engineering teams
Build visibility into infra costs to raise awareness across engineering and empower teams to make cost-aware decisions
Champion reliability best practices by shaping incident processes, defining SLOs, and fostering a culture of ownership and continuous improvement

Qualification

GCP infrastructureInfrastructure-As-CodeObservabilityKubernetesCloud ProvidersUnix-based environmentProactive attitudeCommunicationTeamworkSelf-starter mindset

Required

You are able to prioritize collaboratively between tactical problems and strategic direction
You are comfortable and effective working in a terminal in a Unix-based environment
You are confident in driving Infrastructure-As-Code principles
You have experience working with any of the major Cloud Providers
You have strong communication skills and are comfortable expressing your ideas to multiple different audiences
You are proactive with a positive attitude, well organised, and adept at managing competing deadlines and priorities
You are comfortable with and excited by a fast-paced and often changing environment, eager to solve new problems and learn new skills in order to succeed
You have a self-starter mindset; you proactively identify issues and opportunities and tackle them without being told to do so
Teamwork is at your core, and you like to help others grow and succeed

Preferred

You can demonstrate experience scaling large distributed architectures
You have experience in working in a remote-first environment
You have worked with satellite data and/or imagery
You have prior experience with Kubernetes

Benefits

Flexible working environment with a lot of autonomy.
Remote working budget
Educational budget
Time to develop new skills
Equity
Competitive salary

Company

Overstory

twittertwitter
company-logo
Overstory is AI-powered grid resilience software that helps electric utilities prevent wildfires and power outages.

Funding

Current Stage
Growth Stage
Total Funding
$68.07M
Key Investors
Blume EquityB CapitalConvective Capital
2025-11-25Series B· $43M
2023-10-19Series A· $14M
2022-11-10Seed· $5.2M

Leadership Team

leader-logo
Indra den Bakker
CEO & Co-founder
linkedin
Company data provided by crunchbase