Senior Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Gridware · 8 hours ago

Senior Site Reliability Engineer

Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid. They are seeking a Senior Site Reliability Engineer to design, build, and maintain the infrastructure for their cloud-native applications, focusing on security, scalability, and reliability.

EnergyInternet of ThingsPower GridSoftware
check
H1B Sponsor Likelynote
Hiring Manager
Brian Doerr
linkedin

Responsibilities

Design, build, and maintain scalable, secure, and highly available infrastructure on AWS (EKS, EC2, RDS,MSK, S3, VPC …)
Manage and optimize Kubernetes clusters (EKS) and deploy applications using ArgoCD with GitOps best practices
Implement and maintain CI/CD pipelines using GitHub Actions (GHA), ensuring fast, reliable, and automated software delivery
Build and support Kafka-based event streaming platforms using Amazon MSK for high-throughput, low-latency data pipelines
Manage identity and access across platforms with IdP integration (Okta, Auth0, or similar)
Define and manage Infrastructure as Code with Terraform
Monitor, troubleshoot, and optimize system performance, cost, and reliability using observability tools like Grafana and Loki

Qualification

AWS infrastructure managementKubernetes administrationInfrastructure as CodeCI/CD automationMonitoringLoggingEvent streaming (Kafka)NetworkingSecurityIdP integrationCollaborationProblem-solving

Required

5+ years in DevOps/SRE/Platform Engineering, with production experience in AWS infrastructure management
Deep knowledge of Kubernetes administration and GitOps tools like ArgoCD
Proficiency with Infrastructure as Code with Terraform
Hands-on experience with CI/CD automation and pipelines (preferably GitHub Actions)
Expertise in running and maintaining distributed systems such as Kafka on MSK and relational databases (RDS)
Strong understanding of networking, security best practices, and IdP-driven access control
Experience with monitoring and logging solutions (Grafana, Loki, Prometheus, or similar)
Ability to debug complex production issues across infrastructure, deployment, and networking layers

Preferred

Familiarity with Databricks or ML Ops pipelines for data and model deployment
Experience with Terragrunt
Knowledge of multi-cloud or hybrid cloud environments and container security tools

Benefits

Health, Dental & Vision (Gold and Platinum with some providers plans fully covered)
Paid parental leave
Alternating day off (every other Monday)
“Off the Grid”, a two week per year paid break for all employees.
Commuter allowance
Company-paid training

Company

Gridware

twittertwittertwitter
company-logo
Gridware is a grid‑technology company dedicated to improving safety and reliability on the electrical transmission and distribution systems.

H1B Sponsorship

Gridware has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (7)
2024 (1)
2022 (2)
2021 (1)

Funding

Current Stage
Growth Stage
Total Funding
$97.2M
Key Investors
Sequoia Capital
2025-11-17Series B· $55M
2025-01-08Series A· $26.4M
2023-06-13Seed· $10.5M

Leadership Team

leader-logo
Timothy Barat
Co-Founder & CEO
linkedin
leader-logo
Abdulrahman Bin Omar
Chief Product Officer & Co-founder
linkedin
Company data provided by crunchbase