SIGN IN
Director, Site Reliability Engineering jobs in United States
cer-icon
Apply on Employer Site
company-logo

Barracuda · 2 hours ago

Director, Site Reliability Engineering

Barracuda is dedicated to making the world a safer place with its cloud-enabled security solutions. They are looking for a strategic and visionary Director of Site Reliability Engineering to lead global reliability initiatives across their SaaS portfolio, overseeing a distributed team and collaborating with various departments to ensure high availability and performance of their platforms.
Cloud InfrastructureEnterprise SoftwareSecuritySoftware
check
H1B Sponsor Likelynote

Responsibilities

Define and execute Barracuda’s global SRE strategy, aligning reliability goals with business objectives and customer SLAs
Drive continuous improvement in availability, latency, performance, and cost optimization across all cloud services
Implement AI-driven observability and anomaly detection for proactive incident prevention; deploy agentic automation systems to manage routine operational tasks, optimize cloud resources, and accelerate remediation workflows; explore LLM-based runbooks and autonomous agents for incident triage and root cause analysis
Partner with Engineering, Security, and FinOps teams to embed reliability into product design and delivery pipelines
Influence architectural decisions for reliability, disaster recovery, and observability systems; ensure compliance with security and regulatory standards
Champion Infrastructure-as-Code and CI/CD automation at scale using Terraform, Cloud Formation, GitHub Actions, and Jenkins
Facilitate incident response protocols, conduct executive-level postmortems, and implement proactive risk mitigation strategies
Define and enforce SLIs and SLOs across global services; report reliability metrics to executive leadership
Build and mentor a high-performing SRE organization; foster a culture of ownership, innovation, and collaboration across regions
Lead initiatives for cost governance and performance tuning in AWS and Azure environments
Present reliability roadmaps, KPIs, and risk assessments to senior leadership and stakeholders

Qualification

Cloud OperationsAWSAzureAI-driven AutomationInfrastructure as CodeCI/CD AutomationKubernetesObservability ToolsPythonCertificationsLeadership Skills

Required

12+ years in infrastructure, cloud operations, or SRE roles, including 5+ years in leadership positions managing distributed teams
Deep knowledge of AWS and Azure architectures, security, and operations in large-scale SaaS environments
Experience implementing AI-driven observability, predictive analytics, and autonomous remediation systems
Proven success implementing Infrastructure as Code such as Terraform or CloudFormation at enterprise scale
Advanced experience with GitHub Actions, Jenkins, and deployment strategies (blue/green, canary, rolling)
Expertise in Kubernetes (EKS, AKS) and containerized workloads
Strong background in Prometheus, Grafana, ELK, and APM tools; experience designing self-healing systems
Proficiency in Python, Go, or similar languages for automation and tooling
Exceptional ability to lead globally distributed teams, influence cross-functional stakeholders, and drive cultural change

Preferred

AWS Solutions Architect/DevOps Professional and Kubernetes certifications (CKA, CKAD) preferred

Benefits

High-quality health benefits
Retirement plan with employer match
Flexible time off

Company

Barracuda

twittertwittertwitter
company-logo
Barracuda is a leading global cybersecurity company providing complete protection against complex threats for all size business.

H1B Sponsorship

Barracuda has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (7)
2024 (6)
2023 (10)
2022 (13)
2021 (12)
2020 (9)

Funding

Current Stage
Late Stage
Total Funding
$61M
Key Investors
Menlo VenturesPalomar VenturesDaiwa Securities Group,NIF Ventures
2007-09-17Acquired
2005-10-17Series Unknown· $15M
2003-04-08Series C· $20M

Leadership Team

leader-logo
Hatem Naguib
Chief Executive Officer
linkedin
leader-logo
Fleming Shi
Chief Technology Officer
linkedin
Company data provided by crunchbase