Palo Alto Networks · 2 months ago
Principal Site Reliability Engineer (Prisma AIRS)
Palo Alto Networks is committed to being the cybersecurity partner of choice and is seeking a Principal Site Reliability Engineer for their Prisma AIRS team. The role involves designing, building, and operating cloud-native applications while collaborating closely with software engineers and researchers to enhance AI security capabilities.
Agentic AICloud SecurityCyber SecurityNetwork SecuritySecurity
Responsibilities
Operate Prisma AIRS Cloud Services through contemporary Reliability Engineering practices
Design, Build, Operate and Secure Cloud-Native Microservice Applications at Global Scale
Own End-to-End Service Delivery in Production - Availability, Performance, Scalability, Security
Partner with Software & ML Engineers to design and build new capabilities and features
Banish toil through automation - from shell scripting to cluster orchestration to dynamic CI pipelines
Gain a deep understanding of how we deliver AI Security; you'll be able troubleshoot end-to-end a production issue from an inbound HTTP request, through the network, webserver, model inferencing, database, down to the hardware layer
Qualification
Required
You must be an expert in all things Kubernetes; you have a deep understanding of Kubernetes concepts, experience with building and operating production applications in multi-cluster environments, writing Helm charts from scratch and interacting with the Kubernetes API
You must be an expert in either GCP or AWS, with at least 5 years of experience building and operating production cloud infrastructure at scale
You must have significant Software Engineering / Development experience building applications in Go and/or Python
You should have demonstrated experience in network operations, such as cloud networking, network security, and/or distributed computing systems
You should have demonstrated experience in Linux administration, particularly in the context of cloud-native distributed systems, container runtimes, or Linux server fleets
You should have experience with Relational Databases and SQL; you know how to read, write and refactor SQL queries, identify opportunities for and design secondary indexes, manage database objects such as tables, views, stored procedures, and perform backup/restore operations
You should have experience designing, building and maintaining CI and/or GitOps pipelines for complex multi-application/multi-environment projects
You should have experience in building application observability through Prometheus / OpenTelemetry metrics, Structured Logging or Distributed Tracing systems
Preferred
You may have practical experience in Information Security, such as Cloud / Application / Network Security and are familiar with compliance programs such as SOC2, ISO/IEC 27001, PCI-DSS, FedRAMP or control frameworks such as MITRE ATT&CK, NIST 800-53, OWASP or others
You may have experience with running LLM / Machine Learning Inferencing Servers at scale across heterogeneous multi-GPU cloud environments
Benefits
Restricted stock units
Bonus
Company
Palo Alto Networks
Palo Alto Networks is a cybersecurity company that offers cybersecurity solutions for organizations.
H1B Sponsorship
Palo Alto Networks has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (579)
2024 (482)
2023 (341)
2022 (452)
2021 (493)
2020 (235)
Funding
Current Stage
Public CompanyTotal Funding
$65MKey Investors
Icon VenturesLehman HoldingsGlobespan Capital Partners
2012-07-20IPO
2008-11-03Series C· $10M
2008-08-18Series C· $27M
Recent News
2026-01-13
Jerusalem Post
2026-01-11
2026-01-09
Company data provided by crunchbase