Principal Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Zefr · 9 hours ago

Principal Site Reliability Engineer

Zefr is the leading global technology company enabling responsible marketing in walled garden social environments. As a Principal Site Reliability Engineer, you will serve as a technical leader and subject matter expert, defining the technical vision and shaping the direction of reliability practices across the organization.

AdvertisingEnterprise SoftwareInternetMarketingVideoVideo Advertising
check
Work & Life Balance
check
H1B Sponsor Likelynote

Responsibilities

Support and build systems and tools that enable other engineers to generate, deploy, and manage product features and models both quickly and safely
Deploy and support a multi-cloud, micro-service architecture, including infrastructure tailored for ML workloads, deployed via Github Actions, ArgoCD & Kubernetes
Collaborate with other engineers to architect secure, resilient, scalable, and cost-efficient applications and ML systems/pipelines in AWS and GCP
Foster and push our DevOps culture and philosophy by encouraging continuous improvement across all engineering teams
Proactively maintain the health of production environments, including monitoring application performance and resource utilization
Participate in 24/7 on-call rotation, respond to system performance issues and outages
Debug code at the application and infrastructure level
Mature our CI/CD workflows and release process
Maintains a forward-thinking approach, actively researching and proposing new solutions
Propose and review Engineering Request for Comments (RFC) to drive Engineering architecture and practices

Qualification

Cloud InfrastructureKubernetesCI/CDObservabilityPythonTerraformGitOpsIncident ManagementContinuous ImprovementTechnical LeadershipCommunication SkillsMentoring

Required

10+ year job history designing, managing, deploying, and supporting Cloud Infrastructure in a production environment using major public cloud providers (GCP experience a huge bonus)
Experience in Advertising or AdTech
Demonstrated technical leadership experience; including mentoring engineers, driving cross-functional projects, and influencing architectural decisions at an organizational level
Knowledge of GitOps including an understanding of modern CI/CD pipelines, techniques and technologies (Github Actions, GitLab, CircleCI, Argo CD, Flux)
Advanced Proficiency with IaC and configuration management tools (Terraform, Terragrunt, OpenTofu, Crossplane, Pulumi)
Deep production experience architecting, managing, deploying, and supporting container based workloads into Kubernetes clusters
Proven track record of building and scaling reliability practices, including SLO/SLI frameworks, incident management, and capacity planning
Heavy Production experience with observability platforms and practices (Prometheus, Grafana, Chronosphere, Datadog, OpenTelemetry); ability to design monitoring strategies for complex distributed systems
Strong knowledge of cloud networking (Mesh, NAT, Load Balancers, API Gateways, proxies, etc), cloud security, and cost optimization strategies
Exceptional written and verbal communication skills; ability to translate complex technical concepts for diverse audiences and build consensus across teams
Experience authoring technical strategy documents, RFCs, and architectural proposals

Benefits

Flexible PTO
Medical, dental, and vision insurance with FSA options
Company-paid life insurance
Paid parental leave
401(k) with company match
Professional development opportunities
13 paid holidays off
Summer Fridays (we leave early)
In-office, hybrid, and fully-remote work options available
In-office lunches and lots of free food
Optional in-person and virtual events (we like to celebrate!)

Company

Zefr is a technology company that uses artificial intelligence to deliver digital advertising services.

H1B Sponsorship

Zefr has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (5)
2024 (3)
2023 (4)
2022 (10)
2021 (9)
2020 (2)

Funding

Current Stage
Late Stage
Total Funding
$65.06M
Key Investors
IVPU.S. Venture PartnersMK Capital
2016-03-09Series E· $5M
2014-02-26Series D· $30M
2012-08-15Series C· $18.5M

Leadership Team

leader-logo
Richard Raddon
Co-Founder, Co-CEO
linkedin
leader-logo
Zach James
Co-Founder & Co-CEO
linkedin
Company data provided by crunchbase