SIGN IN
Senior Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Gradle Inc · 2 hours ago

Senior Site Reliability Engineer

Gradle Technologies is a company that provides a toolchain observability and acceleration platform called Develocity. They are seeking a Senior Site Reliability Engineer to ensure the reliability, performance, and availability of Develocity instances while collaborating with engineering teams to enhance software delivery and operational excellence.
AnalyticsDeveloper ToolsEnterprise SoftwareInformation TechnologyOpen SourceSaaSSoftware
check
Culture & Values
check
H1B Sponsor Likelynote

Responsibilities

Operate and maintain all Develocity instances and supporting services
Participate in a follow-the-sun on-call rotation, owning incident response and troubleshooting issues across the stack
Drive automation across application deployment, upgrades, monitoring, self-healing, and recovery
Build and maintain observability for all managed services (logging, metrics, tracing, and alerting)
Work with engineering teams to build reliability into features from the start
Run incident response and retrospectives, and make sure we learn from them
Own disaster recovery, backups, and business continuity
Communicate with customers during incidents and maintenance windows
Optimize performance, resource usage, and costs
Help evolve our SaaS operations as we grow

Qualification

KubernetesAWSObservability toolsIncident managementScripting PythonScripting BashInfrastructure as CodeSRE best practicesDisaster recoveryCustomer-facing skillsCommunication

Required

5+ years in SRE, DevOps, or equivalent role operating production services at scale
Strong Kubernetes experience in production environments
Cloud infrastructure expertise, preferably AWS (EKS, RDS, S3, EC2)
Proficiency with observability tools (Prometheus, Grafana) and Infrastructure as Code (Terraform)
Track record of incident management and response
Knowledge of SRE best practices (SLAs, SLOs)
Scripting proficiency (Python, Bash) for automation
Experience with 24/7 on-call rotations
Strong written and verbal English communication

Preferred

Experience operating SaaS platforms at scale
Familiarity with Develocity
JVM language experience (Java, Kotlin)
Disaster recovery planning and execution experience
Customer-facing incident communication skills
Experience establishing SRE practices in new or growing teams

Benefits

A ground-floor role in a new SRE team—you'll shape how we do things, not inherit someone else's decisions.
Real ownership of production systems used by engineers at companies you've heard of.
Direct interaction with customers when things go wrong (and when they go right).
A culture that values automation over heroics.
In-person meetings, such as our annual company offsite and team meetings.
Work from home in a remote-first environment.
Competitive salaries and equity grants.

Company

Gradle Inc

twittertwittertwitter
company-logo
Gradle Technologies is the award-winning developer productivity company behind Gradle Build Tool—one of the most used build systems in the world—and Develocity®, the leading developer observability platform.

H1B Sponsorship

Gradle Inc has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (1)
2022 (2)

Funding

Current Stage
Growth Stage
Total Funding
$54.7M
Key Investors
Triangle Peak PartnersHarmony PartnersDCVC
2021-11-18Series C· $27M
2018-10-02Series B· $13M
2016-05-25Series A· $10.5M

Leadership Team

leader-logo
Hans Dockter
CEO
linkedin
leader-logo
Rolf Dockter
CFO / COO
linkedin
Company data provided by crunchbase