Abnormal Security · 2 days ago
Site Reliability Engineer II
Maximize your interview chances
Artificial Intelligence (AI)Cyber Security
H1B Sponsor Likely
Insider Connection @Abnormal Security
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Build tools and processes to standardize deployment of Abnormal Security product suite in a multi-datacenter setup.
Partner with R&D teams to develop pre and post deployment checklists, canary test environments and workflows, and safe rollback processes.
Identify gaps in existing processes and advocate for necessary changes to improve overall system stability and availability.
Lead the Production Readiness Review process to ensure the resilience of systems before customer deployment.
Oversee the Critical Change Management Review process for the safe application of changes to critical services.
Develop and enforce architecture guidelines to minimize downtime and ensure high system availability.
Establish consistent definition of metrics for 'Is this product working'.
Define and monitor SLAs/SLOs for critical systems, actively tracking deviations and triggering alerts when necessary.
Define incident severity classification guidelines and implement incident response protocols to promptly address issues and reduce downtime.
Facilitate effective communication between Engineering and Customer Success teams during incidents.
Design and implement tools to expedite system recovery and minimize the impact of incidents.
Develop guidelines for Post Mortems after incidents to prevent recurrence.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Bachelor’s in Computer Science, Computer Engineering, or equivalent professional experience
4+ experience as a Site Reliability Engineer, responsible for the reliability of shared services
Experience with a public cloud provider (AWS, Azure, GCP), observability stack (Prometheus, Grafana), and incident management tools (PagerDuty, Sentry, Slack integration)
Preferred
Experience with defining and implementing SRE practices such as Change Management, Production Readiness Review, and Incident Post Mortems
Experience with container orchestration, preferably Kubernetes and Helm
Experience developing Infrastructure as Code (IaC) modules and building automation, preferably Terraform
Benefits
Bonus
Restricted stock units (RSUs)
Benefits
Company
Abnormal Security
Abnormal Security is an email security company that protects enterprises and organizations from targeted email attacks.
H1B Sponsorship
Abnormal Security has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (10)
2022 (11)
2021 (16)
2020 (3)
Funding
Current Stage
Late StageTotal Funding
$534MKey Investors
Wellington ManagementCrowdStrike Falcon FundInsight Partners
2024-08-06Series D· $250M
2023-03-29Series Unknown· undefined
2022-05-10Series C· $210M
Recent News
2024-11-22
2024-11-20
Crunchbase News
2024-10-10
Company data provided by crunchbase