Distinguished Engineer - Business Continuity, Governance, and Platform Resilience jobs in United States
cer-icon
Apply on Employer Site
company-logo

GEICO · 17 hours ago

Distinguished Engineer - Business Continuity, Governance, and Platform Resilience

GEICO is seeking an experienced Distinguished Engineer with a passion for building high-performance, low maintenance, zero-downtime platforms and applications. This role focuses on establishing engineering excellence with a specific emphasis on organizational resilience, strategic risk management, and rigorous technical governance.

Auto InsuranceFinancial ServicesGovernmentInsuranceInternetMobile
badNo H1Bnote

Responsibilities

Driving the technical BCDR strategy, ensuring it aligns with critical business and regulatory goals
Conducting comprehensive risk assessments, leading the architecture of highly resilient systems, and defining organization-wide Recovery Time Objective (RTO) and Recovery Point Objective (RPO) metrics
Validating recovery targets by overseeing regular BCDR simulations and Chaos Engineering programs
Setting and rigorously enforcing architectural standards, policies, and blueprints as a key leader within the Architecture Review Board
Ensuring that all major technology investments are strategically aligned with business objectives and compliance requirements
Enforcing domain consistency across architecture layers and driving strategic modernization efforts to maximize scalability and coherence
Leading the SRE strategy by establishing and monitoring Service Level Objectives (SLOs) and error budgets
Developing and maintaining comprehensive incident response plans, runbooks, and playbooks
Driving automation to achieve low Mean Time To Resolution (MTTR)
Analyzing post-incident results to eradicate architectural flaws that drive down Mean Time Between Failures (MTBF)
Acting as a trusted advisor to executive stakeholders on resilience and governance matters
Serving as a role model and mentor to coach senior and principal engineering talent
Analyzing cost and forecast data, playing a critical role in strategic financial stewardship, particularly in Cloud Spend Optimization

Qualification

Site Reliability EngineeringBCDR StrategyDistributed Systems ArchitectureCloud TechnologiesInfrastructure AutomationIncident ManagementVisionary ThinkingLeadership SkillsCommunication SkillsMentoring

Required

Fluency and specialization in software development and best practices using modern programming languages
Deep knowledge of SRE practices, methodologies, and principles, along with a solid understanding of cloud-based compute, network, and storage technologies
Strong background in incident management (a core function of Case Management in platform operations), including the ability to create incident response playbooks, runbooks, and perform rigorous post-incident analysis to drive continuous improvement in reliability and availability
Expertise in distributed systems architecture, replication topologies, and distributed consistency patterns to meet stringent RTO and RPO requirements
Understanding of SQL and NoSQL databases, including stateful services management, storage, and optimization strategies for resilience and cloud cost efficiency
In-depth knowledge of hybrid cloud architecture, IaaS and PaaS technologies, container orchestration platforms (e.g., Kubernetes), and cloud efficiency
Experience with infrastructure automation, tooling, and configuration management frameworks (e.g., Ansible, Terraform)
Exceptional leadership and communication skills, with a passion for mentoring and fostering professional growth
Visionary thinker with the ability to anticipate future challenges and opportunities in resilience and governance
Proven track record of successfully leading, designing, and delivering complex engineering projects in large and complex organizations
12+ years of professional software development experience
10+ years of experience with architecture and design
6+ years of experience in open-source frameworks
6+ years of experience with AWS, GCP, Azure, or another cloud service
Bachelor's degree in computer science, Information Systems, or equivalent education or work experience

Benefits

Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being.
Financial benefits including market-competitive compensation; a 401K savings plan vested from day one that offers a 6% match; performance and recognition-based incentives; and tuition assistance.
Access to additional benefits like mental healthcare as well as fertility and adoption assistance.
Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year.

Company

GEICO, Government Employees Insurance Company, has been providing affordable auto insurance since 1936. It is a sub-organization of Berkshire Hathaway.

Funding

Current Stage
Late Stage
Total Funding
unknown
1996-01-01Acquired

Leadership Team

leader-logo
Todd Combs
Chairman, President, and Chief Executive Officer
leader-logo
Clayton Johnson
Sr. Director of Product Management
linkedin

Recent News

Bizjournals.com Feed (2025-11-12 15:43:17)
Beinsure - Insurance, Reinsurance, InsurTech Insights
Company data provided by crunchbase