Lead Site Reliability Engineer @ Henry Meds | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
Lead Site Reliability Engineer jobs in United States
109 applicants
company-logo

Henry Meds · 1 week ago

Lead Site Reliability Engineer

Wonder how qualified you are to the job?

Digital MediaHealth Care
check
Actively Hiring

Insider Connection @Henry Meds

Discover valuable connections within the company who might provide insights and potential referrals, giving your job application an inside edge.

Responsibilities

Architect and create our observability and monitoring system.
Create a disaster recovery plan and facilitate disaster recovery testing. Familiarity with DiRT exercises is a plus.
Oversee teams who are responsible for the design, architecture, and development of operational infrastructure within our platform.
Assist in hiring to perform daily operations and embed SRE operations across the department.
Provide architectural and technical guidance and mentorship to SRE teams, fostering skill development, and building strong and capable SRE practices.
Lead and prioritize multiple projects, create roadmaps, and drive implementation plans.
Partner with product and engineering stakeholders to proactively identify operational needs and deliver solutions.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

GCPDisaster RecoveryIdentity Access ManagementSecurity ManagementIncident ManagementPerformance ManagementCapacity PlanningTroubleshootingMonitoringLogging StandardsPerformance TestingChaos EngineeringResilience TestingInfrastructure as CodeTerraformSREAWSPaaSIaaSObservabilityDatadogGrafanaOpsGeniePagerDutyCloud MonitoringRedundancyConfiguration ManagementNetworking TechnologiesIAM ModelAutomation Solutions

Required

Experience in GCP working with stakeholders to develop and document resilient services, across multiple edge and availability zones, with documented comprehensive disaster recovery plans and regularly conduct drills and exercises to test and validate the effectiveness of these plans
Experience managing identity and access management to control resources and services in GCP and work with stakeholders to develop security practices and procedures to ensure compliance with industry best practices and regulations
Experience managing the security and monitoring systems in our cloud that ensure our systems health
Experience leading incident management processes, conducting post-mortems, and driving improvements to prevent future incidents
Experience setting up availability expectations, addressing performance issues, uncovering observability gaps, leading problem management, and driving capacity planning
The ability to manage cloud operations, installing, maintaining, and monitoring network resources
Experience Defining SLOs, SLIs, leads on-call support schedules, troubleshooting, building support playbooks, implementing monitoring and alerting, logging standards, and conducting performance testing
Experience creating playbooks utilizing a chaos engineering mindset and resilience testing
Experience architecting Infrastructure As Code using Terraform
2+ years of leading Cloud SRE teams across AWS and Google Cloud Platform
5+ years of hands-on experience with infrastructure design and deployment utilizing Cloud PaaS and IaaS cloud offerings
5+ years of experience in cloud and system observability (Datadog, Grafana, Cloud Profiler) and alerting (OpsGenie, PagerDuty, GCP Cloud Monitoring)
5+ years of experience architecting and building infrastructure with a focus on redundancy, reliability, disaster response and discovery
5+ years of configuration/management experience with Cloud networking technologies (GCP IAM model, Terraform, gcloud-cli)
5+ years of cloud Operations knowledge with automation solutions
5+ years of cloud Solutions (Google Cloud Platform), Cloud Run, Containers, Terraform, GCS, C#, TypeScript

Preferred

10 + years of overall in a DevOps or Site Reliability Engineer environment

Benefits

Platinum PPO Healthcare + Vision & Dental
401(k) with matching contributions
Unlimited PTO
Fully remote position with occasional travel

Company

Henry Meds

twittertwitter
company-logo
Henry is a platform that empowers clinicians to deliver affordable treatment for chronic medical conditions.

Funding

Current Stage
Early Stage
Total Funding
unknown
Key Investors
Heron Rock Fund
2022-10-14Seed· Undisclosed
2022-01-28Pre Seed· Undisclosed

Leadership Team

leader-logo
Steven Peacock
Chief Medical Officer
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot