Fortinet · 1 month ago
Principal Site Reliability Engineer
Fortinet is a global leader in cybersecurity, dedicated to securing clouds and container environments for B2B customers. The Principal Site Reliability Engineer will lead the design, implementation, and optimization of scalable and resilient platform infrastructure, driving strategic initiatives to enhance operational excellence and mentoring teams across the organization.
Cyber SecurityMobileNetwork SecuritySecurity
Responsibilities
Architect and implement advanced automation strategies to maximize operational efficiency and minimize toil across the FortiCNAPP platform
Lead the design, development, and enhancement of infrastructure systems to ensure world-class scalability, resiliency, and performance
Proactively identify and resolve complex, systemic issues through innovative automation, tooling, and architectural solutions, preventing customer-facing incidents
Drive the evolution of monitoring, instrumentation, and observability systems to anticipate and mitigate scalability and reliability risks before they impact customers
Champion company-wide adoption of reliability best practices, establishing key metrics, SLAs, and milestones to embed scalability and resiliency into all engineering processes
Collaborate with cross-functional teams to define and implement industry-leading practices for infrastructure, deployment, and operational workflows
Provide technical leadership and mentorship to engineering and operations teams, fostering a culture of reliability, automation, and continuous improvement
Lead incident response and post-mortem processes, driving root cause analysis and implementing preventive measures
Participate in an on-call rotation, serving as an escalation point for complex issues and guiding the team through critical incidents
Influence strategic technology decisions, evaluating and integrating cutting-edge tools, services, and methodologies to enhance platform reliability
Qualification
Required
10+ years of DevOps/SRE experience, with at least 5 years in a senior or lead role managing production systems at scale
Expert-level development and automation skills, with a proven track record of building sophisticated tools and workflows
Deep expertise in Infrastructure as Code (e.g., Terraform) and supporting tools (e.g., Atlantis, ArgoCD, Flux)
Advanced experience with Kubernetes and its ecosystem (e.g., Helm, operators, Kustomize), including managing large-scale, production-grade clusters
Extensive experience with multiple cloud providers and managed services (e.g., AWS: EKS, EC2, S3, RDS, Secrets Manager; GCP, Azure)
Proven ability to architect and operate highly reliable, fault-tolerant cloud infrastructure that supports rapid microservice deployment with robust monitoring and high availability
Exceptional cross-team communication and leadership skills, with experience driving alignment across engineering, product, and operations teams
Deep knowledge of large-scale system building blocks, including load balancing, distributed/cloud computing, container orchestration, and advanced monitoring/observability
Expert understanding of cloud networking, including VPC configuration, cross-cloud connectivity, and hybrid cloud architectures
Proficiency in one or more programming languages (e.g., Python, Go, Rust) for building tools and automation frameworks
Preferred
Extensive experience designing and implementing advanced monitoring and observability systems (e.g., Prometheus, Grafana, New Relic, Datadog, OpenTelemetry)
Strong advocate for 'everything as code' principles, with experience institutionalizing IaC and GitOps practices across teams
Deep expertise in Java application servers, JVM tuning, and performance optimization for high-throughput systems
Experience leading cross-functional initiatives to improve system reliability, such as chaos engineering, disaster recovery planning, or zero-downtime deployments
Benefits
Medical
Dental
Vision
Life and disability insurance
401(k)
11 paid holidays
Vacation time
Sick time
Comprehensive leave program
Company
Fortinet
Fortinet is a provider of network security appliances that include firewalls, security gateways, and complementary products. It is a sub-organization of Fortinet Federal.
H1B Sponsorship
Fortinet has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (200)
2024 (152)
2023 (155)
2022 (175)
2021 (139)
2020 (161)
Funding
Current Stage
Public CompanyTotal Funding
$89MKey Investors
Meritech Capital PartnersDEFTA Partners
2009-11-18IPO
2004-03-03Series Unknown· $50M
2003-08-29Series D· $30M
Recent News
2026-01-11
dcm.com
2025-12-31
Company data provided by crunchbase