Site Reliability Engineer Lead jobs in United States
cer-icon
Apply on Employer Site
company-logo

Patterson-UTI · 4 hours ago

Site Reliability Engineer Lead

Patterson-UTI is seeking a Site Reliability Engineer Lead to own and evolve the reliability, scalability, and operational excellence of cloud-native data platforms primarily on Google Cloud Platform (GCP). The role involves leading SRE practices, designing reliability metrics, and mentoring engineers while ensuring the operational integrity of data systems in the oilfield and energy environments.

Oil & Energy

Responsibilities

Lead SRE practices for GCP-based data platforms
Design and own SLIs, SLOs, error budgets, and reliability metrics
Build and maintain cloud-native observability (monitoring, logging, alerting)
Lead incident response for production cloud systems and drive postmortems
Partner with data engineering and platform teams to design reliable architectures
Automate operational workflows using Python
Drive improvements in CI/CD, infrastructure as code, and deployment safety
Mentor engineers and set SRE best practices across the team

Qualification

Google Cloud PlatformPythonInfrastructure as CodeKubernetesCI/CDProduction experienceObservabilityOilGas knowledgeEnglish proficiencyTechnical leadership

Required

7+ years in SRE, Cloud Platform Engineering, or DevOps
Strong hands-on experience with Google Cloud Platform, including: GCP: GKE, Compute Engine, Cloud Storage, Pub/Sub (or equivalents)
Cloud Monitoring & Logging
BigQuery
Dataflow
Datastream
IAM and networking
Composer/Airflow
Kubernetes: deployment, scaling, reliability patterns
CI/CD: GitHub Actions, GitLab CI, or similar
Observability: GCP Cloud Monitoring, Logging
Experience supporting cloud-native data systems (batch and streaming)
Production experience with Python for automation, tooling, or services
Infrastructure as Code experience (Terraform strongly preferred)
Experience operating systems in 24/7 production environments
Bachelor's degree in Business, Information Technology, Computer Science, or a related field
5+ years experience in Site Reliability Engineering, Cloud Platform Engineering, or DevOps
3+ years operating production workloads on Google Cloud Platform (GCP)
Prior technical leadership experience (lead engineer, tech lead, or ownership of reliability initiatives)
Ability to understand and speak English at a level of proficiency allowing employee to issue, receive and respond to both safety and operations-related directions in English

Preferred

Oil and Gas Industry knowledge
Technology/Digital Industry knowledge

Company

Patterson-UTI

twitter
company-logo
Patterson-UTI is a leading provider of drilling and completion services to oil and natural gas exploration and production companies in the United States and other select countries, including contract drilling services, integrated well completion services and directional drilling services in the United States, and specialized bit solutions in the United States, Middle East and many other regions around the world.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Andy Hendricks
President & Chief Executive Officer | Board Director
linkedin
leader-logo
Mike Holcomb
Chief Operating Officer
linkedin
Company data provided by crunchbase