Distinguished Engineer - Business Continuity, Governance, and Platform Resilience jobs in United States
cer-icon
Apply on Employer Site
company-logo

GEICO · 2 months ago

Distinguished Engineer - Business Continuity, Governance, and Platform Resilience

GEICO is a leading insurance company dedicated to providing quality coverage to millions of customers. They are seeking an experienced Distinguished Engineer to drive enterprise transformation by establishing engineering excellence focused on resilience, risk management, and technical governance.

Auto InsuranceFinancial ServicesGovernmentInsuranceInternetMobile
badNo H1Bnote

Responsibilities

Drive the technical BCDR strategy, ensuring it aligns with critical business and regulatory goals
Conduct comprehensive risk assessments, leading the architecture of highly resilient systems
Define organization-wide Recovery Time Objective (RTO) and Recovery Point Objective (RPO) metrics
Validate recovery targets by overseeing regular BCDR simulations and Chaos Engineering programs
Institutionalize technical excellence across the organization
Serve as a key leader within the Architecture Review Board, setting and enforcing architectural standards
Ensure all major technology investments are strategically aligned with business objectives and compliance requirements
Enforce domain consistency across architecture layers and drive strategic modernization efforts
Lead the SRE strategy by establishing and monitoring Service Level Objectives (SLOs) and error budgets
Develop and maintain comprehensive incident response plans, runbooks, and playbooks
Drive automation to achieve low Mean Time To Resolution (MTTR)
Analyze post-incident results to eradicate architectural flaws that drive down Mean Time Between Failures (MTBF)
Act as a trusted advisor to executive stakeholders on resilience and governance matters
Serve as a role model and mentor to coach senior and principal engineering talent
Analyze cost and forecast data, playing a critical role in strategic financial stewardship

Qualification

Site Reliability EngineeringBCDR StrategyDistributed Systems ArchitectureCloud ArchitectureIncident ManagementInfrastructure AutomationSQLNoSQL DatabasesVisionary ThinkingLeadership SkillsCommunication SkillsMentoring Skills

Required

Fluency and specialization in software development and best practices using modern programming languages
Deep knowledge of SRE practices, methodologies, and principles, along with a solid understanding of cloud-based compute, network, and storage technologies
Strong background in incident management (a core function of Case Management in platform operations), including the ability to create incident response playbooks, runbooks, and perform rigorous post-incident analysis to drive continuous improvement in reliability and availability
Expertise in distributed systems architecture, replication topologies, and distributed consistency patterns to meet stringent RTO and RPO requirements
Understanding of SQL and NoSQL databases, including stateful services management, storage, and optimization strategies for resilience and cloud cost efficiency
In-depth knowledge of hybrid cloud architecture, IaaS and PaaS technologies, container orchestration platforms (e.g., Kubernetes), and cloud efficiency
Experience with infrastructure automation, tooling, and configuration management frameworks (e.g., Ansible, Terraform)
Exceptional leadership and communication skills, with a passion for mentoring and fostering professional growth
Visionary thinker with the ability to anticipate future challenges and opportunities in resilience and governance
Proven track record of successfully leading, designing, and delivering complex engineering projects in large and complex organizations
12+ years of professional software development experience
10+ years of experience with architecture and design
6+ years of experience in open-source frameworks
6+ years of experience with AWS, GCP, Azure, or another cloud service
Bachelor's degree in computer science, Information Systems, or equivalent education or work experience

Benefits

Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being.
Financial benefits including market-competitive compensation; a 401K savings plan vested from day one that offers a 6% match; performance and recognition-based incentives; and tuition assistance.
Access to additional benefits like mental healthcare as well as fertility and adoption assistance.
Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year.

Company

GEICO, Government Employees Insurance Company, has been providing affordable auto insurance since 1936. It is a sub-organization of Berkshire Hathaway.

Funding

Current Stage
Late Stage
Total Funding
unknown
1996-01-01Acquired

Leadership Team

leader-logo
Todd Combs
Chairman, President, and Chief Executive Officer
leader-logo
Clayton Johnson
Sr. Director of Product Management
linkedin
Company data provided by crunchbase