Senior Machine Learning Infrastructure Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Abridge · 2 months ago

Senior Machine Learning Infrastructure Engineer

Abridge is a pioneering company in healthcare technology, focused on enhancing medical conversations through AI. As a Senior Machine Learning Infrastructure Engineer, you will be responsible for building and optimizing the infrastructure that supports machine learning models, ensuring scalability and efficiency in AI-driven solutions.

Artificial Intelligence (AI)Health CareIntelligent SystemsMachine LearningMedical
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design, deploy and maintain scalable Kubernetes clusters for AI model inference and training
Develop, optimize, and maintain ML model serving and training infrastructure, ensuring high-performance and low-latency
Collaborate with ML and product teams to scale backend infrastructure for AI-driven products, focusing on model deployment, throughout optimization, and compute efficiency
Optimize compute-heavy workflows and enhance GPU utilization for ML workloads
Build a robust model API orchestration system
Collaborate with leadership to define and implement strategies for scaling infrastructure as the company grows, ensuring long-term efficiency and performance

Qualification

Kubernetes administrationMachine learning modelsDistributed systems architectureAPI developmentGPU optimizationInfrastructure as codeCommunication

Required

Strong experience in building and deploying machine learning models in production environments
Deep understanding of container orchestration and distributed systems architecture
Expertise in Kubernetes administration, including custom resource definitions, operators, and cluster management
Experience developing APIs and managing distributed systems for both batch and real-time workloads
Excellent communication skills, with the ability to interface between research and product engineering

Preferred

Expertise with model serving frameworks such as NVIDIA Triton Server, VLLM, TRT-LLM and so on
Expertise with ML toolchains such as PyTorch, Tensorflow or distributed training and inference libraries
Familiarity with GPU cluster management and CUDA optimization
Knowledge of infrastructure as code (Terraform, Ansible) and GitOps practices
Experience with container registries, image optimization, and multi-stage builds for ML workloads
Experience orchestrating across ASR models or LLM models for building various GenAI applications

Benefits

Generous Time Off : 13 paid holidays, flexible PTO for salaried employees, and accrued time off for hourly employees.
Comprehensive Health Plans : Medical, Dental, and Vision plans for all full-time employees. Abridge covers 100% of the premium for you and 75% for dependents. If you choose a HSA-eligible plan, Abridge also makes monthly contributions to your HSA.
Paid Parental Leave : 16 weeks paid parental leave for all full-time employees.
401k and Matching : Contribution matching to help invest in your future.
Pre-tax Benefits: Access to Flexible Spending Accounts (FSA) and Commuter Benefits.
Learning and Development Budget : Yearly contributions for coaching, courses, workshops, conferences, and more.
Sabbatical Leave : 30 days of paid Sabbatical Leave after 5 years of employment.
Compensation and Equity : Competitive compensation and equity grants for full time employees.

Company

Abridge

twittertwittertwitter
company-logo
Abridge is an AI-driven platform that transforms patient-clinician conversations into structured clinical notes for healthcare industries.

H1B Sponsorship

Abridge has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)
2024 (1)
2022 (1)

Funding

Current Stage
Late Stage
Total Funding
$757.5M
Key Investors
Spark CapitalIKS HealthWittington Ventures
2025-06-24Series E· $300M
2025-02-17Series D· $250M
2024-02-23Series C· $150M

Leadership Team

leader-logo
Jonathan Lydon
VP People
linkedin
Company data provided by crunchbase