Okta · 4 months ago
Staff Site Reliability Engineer
Okta is The World’s Identity Company, specializing in secure access and authentication solutions. They are seeking a highly skilled Staff Site Reliability Engineer to design, build, and maintain reliable and scalable infrastructure for their security SaaS offerings while collaborating with various teams to ensure compliance and security.
CRMEnterprise SoftwareIdentity ManagementIT InfrastructureManagement Information SystemsSoftwareWeb Development
Responsibilities
Design, build, and maintain the core infrastructure that underpins our security SaaS offerings, ensuring high availability, performance, and scalability. This includes building and operating the tooling for our Snowflake data systems
Develop robust automation using code to eliminate toil and ensure consistency across our environments. You'll be a key driver in automating everything from infrastructure provisioning to application deployment and incident response
Work closely with our security teams to embed a security-first mindset into all our processes and infrastructure. You will be responsible for ensuring our systems and data platforms are compliant with industry standards
Participate in on-call rotations and be a primary responder for critical incidents, leading root cause analysis and implementing preventative measures to ensure issues don't recur
Partner with development, data science, and security teams to provide expert guidance on architectural decisions, best practices, and the implementation of new services
Qualification
Required
Strong Coding Skills: You are a developer at heart and are comfortable writing production-level code to solve complex operational challenges
Infrastructure as Code (IaC): Deep experience with Terraform for provisioning and managing cloud infrastructure and services
Continuous Delivery: Familiarity with modern CI/CD practices and tools, particularly Spinnaker, to automate and standardize our release pipelines
Containerization & Orchestration: Expertise in container technologies and hands-on experience managing large-scale, production-ready clusters with Kubernetes
Database Migrations: Experience with database schema management tools like Flyway for safely and reliably handling database changes
Data Systems: Direct experience with large-scale data systems, specifically with the Snowflake platform
Problem-Solving: Excellent analytical and problem-solving skills with a proactive approach to identifying and addressing potential issues
This position requires the ability to access federal environments and/or have access to protected federal data. As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire
This role requires in-person onboarding and travel to our San Francisco, CA HQ office or our Chicago Office during the first week of employment
Preferred
AI/ML Experience (a plus): Experience or a strong interest in AI/ML, particularly how these technologies can be applied to improve reliability, security, and operational efficiency (e.g., AIOps, predictive analysis)
Benefits
Health, dental and vision insurance
401(k)
Flexible spending account
Paid leave (including PTO and parental leave)
Company
Okta
Okta is a management platform that secures critical resources from cloud to ground for workforce and customers.
Funding
Current Stage
Public CompanyTotal Funding
$1.23BKey Investors
Sequoia CapitalAndreessen Horowitz
2020-06-08Post Ipo Equity· $1B
2017-04-06IPO
2017-03-30Secondary Market
Recent News
2026-01-06
Company data provided by crunchbase