Principal Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Palo Alto Networks · 1 day ago

Principal Site Reliability Engineer

Palo Alto Networks is a leading cybersecurity company dedicated to protecting the digital way of life through innovation and technology. The Principal Site Reliability Engineer will support services running on a large hybrid infrastructure, focusing on automation, architecture, and reliability while influencing cloud-native infrastructure at scale across GCP, AWS, and OCI.

Agentic AICloud SecurityCyber SecurityNetwork SecuritySecurity
check
Growth Opportunities
badNo H1Bnote

Responsibilities

Act as an architect for infrastructure owned by the team—plan ahead and design in line with scale requirements
Design, develop, and execute infrastructure components for the platforms owned by the team
Own Infrastructure as Code(IaC), Monitoring as Code(MaC), Policy as Code(PaC) components and build the golden path for future platforms with best practices
Strive for autonomy with an automation-first mindset, including modern AI-driven approaches
Redefine and continuously update modern CI/CD practices for cloud-native workloads
Perform on-call duties and reduce on-call toil through automation, AI agents, analyzers, and self-healing patterns
Support internal platform users as a forward-deployed engineer, close the feedback loop, and modernize the platform based on user needs
Maintain a security-first mindset without compromising reliability and operability
Design cost-effective infrastructure solutions across AWS, GCP, and OCI, including cost governance, capacity planning, and efficiency improvements

Qualification

KubernetesTerraformCI/CD infrastructureGoLangPythonCNCF toolsCost governanceTroubleshootingCommunication skillsSelf-motivated

Required

BS or MS in Computer Science, a related field, or equivalent professional experience
Expert knowledge of Kubernetes and CNCF ecosystem tools such as Helm, Prometheus, Backstage, Istio, and Crossplane
Strong mastery of Terraform: building reusable modules, designing complex infrastructure offerings operating in protected / restricted environments
Strong foundational knowledge of operating and scaling cloud-native workloads using KEDA, Karpenter, NAP, etc
Ability to architect CI/CD infrastructure for cloud-native workloads—primarily Golang and Python—and build DevSecOps pipelines
Programming skills with GoLang & Python, scripting experience with bash
Strong knowledge of Argo CD, including controlling and scaling thousands of deployments across Kubernetes and multiple clouds
Deep experience in cost governance and optimization at scale, including allocation models, anomaly detection, efficiency recommendations, and guardrails across cloud and Kubernetes workloads
Ability to diagnose and troubleshoot complex distributed systems handling high volume transactions
Excellent written and verbal communication, able to collaborate and rally support
Self-disciplined, self-managed, self-motivated and strong sense of ownership, urgency, and drive
Strong communication skills and the ability to partner across platform, security, and application engineering teams

Benefits

Restricted stock units
Bonus

Company

Palo Alto Networks

company-logo
Palo Alto Networks is a cybersecurity company that offers cybersecurity solutions for organizations.

Funding

Current Stage
Public Company
Total Funding
$65M
Key Investors
Icon VenturesLehman HoldingsGlobespan Capital Partners
2012-07-20IPO
2008-11-03Series C· $10M
2008-08-18Series C· $27M

Leadership Team

leader-logo
Helmut Reisinger
CEO EMEA
linkedin
leader-logo
Nikesh Arora
Chairman CEO
linkedin
Company data provided by crunchbase