Senior Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

HCA Healthcare · 20 hours ago

Senior Site Reliability Engineer

HCA Healthcare is a leading organization in the healthcare industry that values its team members and invests in their development. They are seeking a Senior Site Reliability Engineer to provide best practices for mission-critical applications, enhance system reliability, and drive uptime across the enterprise.

BiotechnologyHealth CareHospitalMedicalPrimary and Urgent Care
check
H1B Sponsor Likelynote

Responsibilities

Practices and adheres to the “Code of Conduct” philosophy and “Mission and Value Statement
Promote a collaborative team environment and work closely with colleagues to achieve business objectives
Collaborate with stakeholders (e.g., business stakeholders, product owners, project managers, and end users) to understand functional and non-functional requirements
Lead Investigations and solution proposals to development and design problems
Participate with team members in scope of work estimation and forecasting
Improve performance of existing software by diagnosing and resolving critical issues
Prepare technical documentation, including software & architectural design evaluation plans, data flow diagrams, test results, and technical manuals
Adhere to and influence established development practices and processes
Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
Ongoing review of technology, infrastructure, and code to enhance and build resiliency into the applications
Create sustainable systems and services through automation and uplifts
Balance feature development & deployments with speed, reliability, and well-defined service-level objectives
Partner with development teams and vendors of 3rd party applications to improve services through rigorous testing and release procedures
Build/Develop automations to “self-heal” applications and reduce the toil of manual operational tasks. Pursuit of operational excellence, uptime, and reliability of our applications
Participate, lead, and drive in creating postmortem analysis of why services broke or degraded, including recommendations for long-term fixes. It may require going across multiple teams and organizations within the enterprise. Determine root-cause for all production-level incidents and write corresponding high-quality RCA reports
Collaborating and building relationships across business and technology organizations, providing sound analysis, and thought leadership
Support system upgrades, architecture design, implementations, and deployments
Ability to work in a complex organization, navigate multiple verticals of expertise and negotiate, guide direct and influence your peers to provide real solutions
Maintain industry knowledge in software development, architecture, and development products, such as databases, security, and automation products

Qualification

Site Reliability EngineeringCloud PlatformsAutomationSystem ArchitectureTerraformCI/CD PipelinesVersion Control (Git)Coding LanguagesLearning MindsetCommunicationProblem SolvingCollaboration

Required

Bachelor's degree Computer Science or related field preferred
5+ years of experience Engineering roles required
A proactive approach to spotting problems, areas for improvement, and performance bottlenecks required
Be a creative thinker, not bound by 'the way things have always been done'. What you know is less important than how well you learn and innovate. We don't need engineers who know all the answers; we need engineers who can invent the answers no one has thought of yet, to the questions yet to be asked required
Experienced in helping define SLIs, SLOs & SLOs, and the experience to build observability to report on operating against those objectives required
Strong ability to communicate complex technical information in a condensed manner to various stakeholders verbally and in writing required
Ability to build and maintain strong cross-functional partnerships at all levels of the organization required
Ability to work, make aligned decisions, plan, and accomplish goals without explicit direction/guidance from leadership required
Experience with system architecture, how software systems interact, and integrate required
Ability to evaluate new technologies to assist senior leadership align it to the HCA Healthcare strategic roadmap required
Strong understanding of SRE practices and implementations required
Expertise in knowledge of Linux and Windows Systems Administration and how to manage through code required
Ability to determine best practices and articulate authoritative direction required
Ability to help establish and grow the SRE principles with the team required
Growth mindset and a willingness to learn new skills, technologies, and frameworks required

Preferred

AZ 104, Terraform preferred
Knowledge of infrastructure, frameworks, and software/cloud design patterns for implementing applications in the cloud. Preferred
Experience in the use and implementation of relevant tools and platforms (e.g., cloud platforms (IaaS and PaaS), web technologies, client-server technologies, continuous integration, and deployment) preferred
Experience with version control (Git) and open-source practices preferred
Experience in one or more coding languages. (JavaScript/Typescript, C#, Python, Java, Swift or Kotlin) preferred
Experience with automation of CI/CD pipelines preferred
Experience with IaC such as Terraform preferred
Strong: Learning and teaching other team members and others external to the team preferred

Benefits

Comprehensive medical coverage that covers many common services at no cost or for a low copay. Plans include prescription drug and behavioral health coverage as well as free telemedicine services and free AirMed medical transportation.
Additional options for dental and vision benefits, life and disability coverage, flexible spending accounts, supplemental health protection plans (accident, critical illness, hospital indemnity), auto and home insurance, identity theft protection, legal counseling, long-term care coverage, moving assistance, pet insurance and more.
Free counseling services and resources for emotional, physical and financial wellbeing
401(k) Plan with a 100% match on 3% to 9% of pay (based on years of service)
Employee Stock Purchase Plan with 10% off HCA Healthcare stock
Family support through fertility and family building benefits with Progyny and adoption assistance.
Referral services for child, elder and pet care, home and auto repair, event planning and more
Consumer discounts through Abenity and Consumer Discounts
Retirement readiness, rollover assistance services and preferred banking partnerships
Education assistance (tuition, student loan, certification support, dependent scholarships)
Colleague recognition program
Time Away From Work Program (paid time off, paid family leave, long- and short-term disability coverage and leaves of absence)
Employee Health Assistance Fund that offers free employee-only coverage to full-time and part-time colleagues based on income.

Company

HCA Healthcare

company-logo
HCA Healthcare provides medical education and healthcare services in locally managed facilities. It is a sub-organization of North Florida Endoscopy Center.

H1B Sponsorship

HCA Healthcare has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2022 (2)
2020 (1)

Funding

Current Stage
Public Company
Total Funding
$8.51B
2025-10-31Post Ipo Debt· $3.25B
2025-02-24Post Ipo Debt· $5.25B
2014-06-25Post Ipo Debt· $3.2M

Leadership Team

leader-logo
Nicholas Manning
Chief Executive Officer
linkedin
leader-logo
Nick Lane
Regional Vice President Human Resources
linkedin
Company data provided by crunchbase