Availity · 2 days ago
Platform Engineer IV (EKS & Observability)
Availity is a leading healthcare engagement platform, transforming the healthcare landscape with powerful tools and insights. As a Platform Engineer IV, you will manage the infrastructure backbone of U.S. healthcare transactions, focusing on Kubernetes, observability, and platform services.
FitnessHealth CareHospital
Responsibilities
Own and evolve our Kubernetes (EKS/Istio) control plane at enterprise scale
Lead the tooling and support for observability and logging (New Relic, Splunk, Cribl, OpenTelemetry) with reliability as your north star
Oversee our EC2 application deployment pipelines and other mission-critical internal platforms in our AWS private cloud
Guide and mentor engineers while setting the bar for operational excellence
Provide technical leadership for the infrastructure engineering and operations team focused on Kubernetes, observability, and platform services
Owning and advancing the Kubernetes/EKS control plane, Istio service mesh, and related networking/security features (mTLS, SSL/TLS)
Managing observability and logging platforms including:
Splunk (EKS + on-prem components, forwarders, deployment server)
Cribl operational pipelines (EKS-based)
New Relic SaaS integrations and Prometheus data ingestion
OpenTelemetry & KubeLogging/Banzai Operator for distributed tracing and logging pipelines
Prometheus/Grafana migrations from on-prem OCP to AWS for metrics scraping and synthetic monitoring
Overseeing EC2 application deployment pipelines for packaged software (commercial and open source) platforms hosted in AWS, including replatforming efforts away from EL7 to cloud-native solutions
Supporting legacy/on-prem platforms as they migrate into AWS (Tidal, Aries pipelines, provider Splunk, legacy base images)
Driving infrastructure-as-code practices (Terraform, Helm, Ansible) for repeatable deployments and environment consistency
Collaborating with engineering, middleware, and product teams to define clear ownership, reduce friction, and ensure platform services enable—not block—delivery
Ensuring upgrades, patching, and platform updates are proactively planned and executed without business disruption
Setting reliability targets and defining operational metrics (availability, latency, error budgets) in line with SRE methodologies
Qualification
Required
Bachelor's degree in computer science or related field, or equivalent work experience
7-10 years of relevant technical and business experience in IT systems delivery, operations, and support (preferably in healthcare or high-transaction environments)
3+ years of experience leading technical engineering efforts involving implementation and management of IT systems
Hands-on expertise with Kubernetes/EKS administration at scale
Hands-on expertise with Terraform, Helm, and AWS services (VPC, IAM, EC2, EKS, Istio)
Hands-on expertise with observability and monitoring tools: Splunk, Cribl, Prometheus/Grafana, OpenTelemetry, New Relic
Hands-on expertise with Linux (RHEL-based) systems administration, including SELinux
Experience bridging infrastructure and development teams, ensuring alignment of roadmaps and goals
Strong leadership skills with the ability to motivate and guide technical teams
Excellent communication skills, with the ability to explain complex technical concepts to both technical and non-technical stakeholders
Preferred
SaaS experience supporting large-scale, mission-critical systems
Familiarity with EC2 application deployment pipelines for packaged software (commercial and open source) and re-platforming to cloud-native environments
Knowledge of service mesh concepts (Istio, Linkerd, etc.)
Background in metrics-driven reliability engineering (SLOs, SLIs, error budgets)
Experience with scripting/programming (JavaScript for Cribl, Python, etc.)
Benefits
Generous HSA company contribution
Healthcare
Vision
Dental benefits
401k match program
Unlimited PTO for salaried associates + 9 paid holidays
Reimburse up to $250/year for gym memberships, participation in racing events, weight management programs, etc.
Education reimbursement
Paid Parental Leave for both moms and dads, both birth parents and adoptive parents
Company
Availity
Availity offers a free access to real-time information and instant responses for healthcare professionals.
Funding
Current Stage
Late StageTotal Funding
$200MKey Investors
Novo HoldingsFrancisco Partners
2021-07-07Secondary Market
2017-10-19Private Equity· $200M
Recent News
MedCity News
2025-11-19
2025-10-22
Company data provided by crunchbase