Software Engineer, Site Reliability (SRE) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Sierra · 2 months ago

Software Engineer, Site Reliability (SRE)

Sierra is a company focused on improving customer experiences through AI. As a Software Engineer on the Site Reliability team, you will define and build the foundation of reliability, observability, and scalability across Sierra’s infrastructure, collaborating closely with engineering and product teams.

Artificial Intelligence (AI)Enterprise SoftwareSaaS
check
H1B Sponsor Likelynote

Responsibilities

Own Sierra’s observability stack—monitoring, alerting, logging, and tracing—to give engineers clear visibility into system health and performance
Partner with product and platform engineers to design systems that are reliable and scalable from day one—not as an afterthought
Design and implement scalable, reliable, and secure cloud infrastructure (AWS) using Terraform and modern DevOps tooling
Improve the reliability and scalability of our LLM deployments, ensuring robust, performant, and cost-effective operation
Lead improvements to deployment pipelines, CI/CD tooling, and incident management processes to reduce downtime and response time
Define the foundation of SRE practices at Sierra, influencing culture, tooling, and best practices across the engineering org

Qualification

Site Reliability EngineeringAWSTerraformObservability systemsContainer orchestrationCloud networkingCI/CD toolingCollaborationProblem-solvingCommunication

Required

5+ years of hands-on experience in Site Reliability or Infrastructure engineering roles for complex SaaS or cloud-based systems
Experience designing for availability, scalability, and reliability at both infrastructure and application layers
Deep experience with Terraform, AWS services, container orchestration, and cloud networking (including IAM and VPC architecture)
Strong background in observability systems (e.g., Prometheus, Grafana, Datadog, or similar)
Experience working with enterprise customers and familiarity with their compliance and networking needs along with integration patterns
Comfortable working in fast-moving environments and collaborating across product, ML, and core engineering teams
Degree in Computer Science or a related field, or equivalent professional experience

Preferred

Experience with LLM infrastructure — optimizing inference performance, managing fine-tuned models, or large-scale model deployment
Past experience in an early-stage startup environment, especially defining SRE culture and tooling from scratch
Familiarity with incident management automation or self-healing infrastructure patterns

Benefits

Flexible (Unlimited) Paid Time Off
Medical, Dental, and Vision benefits for you and your family
Life Insurance and Disability Benefits
Retirement Plan (e.g., 401K, pension) with Sierra match
Parental Leave
Fertility and family building benefits through Carrot
Lunch, as well as delicious snacks and coffee to keep you energized
Discretionary Benefit Stipend giving people the ability to spend where it matters most
Free alphorn lessons

Company

Sierra

twittertwittertwitter
company-logo
Sierra provides a platform that builds and manages conversational AI agents for customer experiences.

H1B Sponsorship

Sierra has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (11)
2024 (2)

Funding

Current Stage
Growth Stage
Total Funding
$635M
Key Investors
SoftBank Vision FundGreenoaks
2025-12-04Series Unknown
2025-09-04Series Unknown· $350M
2024-10-28Series Unknown· $175M

Leadership Team

leader-logo
Bret Taylor
Co-Founder
linkedin
Company data provided by crunchbase