Sr ML Ops Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

The Walt Disney Company · 7 hours ago

Sr ML Ops Engineer

The Walt Disney Company is seeking a highly skilled Sr ML Ops Engineer to build and maintain the infrastructure powering their machine learning and AI frameworks. This role is crucial for enabling seamless workflows for model training, retraining, and deployment, ensuring that AI solutions operate reliably at scale.

Amusement Park and ArcadeAnimationConsumer GoodsDigital MediaE-CommerceMedia and EntertainmentMulti-level MarketingPerforming ArtsResorts
check
H1B Sponsor Likelynote

Responsibilities

Develop, deploy, and maintain scalable infrastructure for machine learning model training, retraining, and inference
Design and optimize CI/CD pipelines specifically tailored for machine learning workflows, ensuring efficient delivery from research to production
Implement robust monitoring and logging systems to track model performance and identify potential issues in production environments
Collaborate with AI researchers and data scientists to ensure infrastructure aligns with project requirements and supports iterative experimentation
Manage compute resources (cloud and on-premises) to enable large-scale distributed training and inference tasks
Containerize machine learning models and applications using Docker and deploy them via Kubernetes or equivalent orchestration systems
Automate deployment workflows for serving ML models using frameworks such as TorchServe, TensorFlow Serving and FastAPI
Implement model versioning, rollback strategies, and governance for maintaining production stability
Optimize cost efficiency and performance of machine learning workflows in cloud environments such as AWS, GCP, or Azure
Stay updated with emerging ML Ops tools and practices, integrating them into existing workflows to improve performance and reliability

Qualification

ML OpsCI/CD pipelinesDockerKubernetesCloud infrastructureTensorFlow ServingTorchServePythonData tracking toolsScripting skillsSecurity best practicesCollaboration

Required

Bachelor's in Computer Science, Engineering, or a related field
5+ years of experience in DevOps, Site Reliability Engineering, or a related role, with at least 2+ years focusing on ML Ops
Expertise in building and maintaining CI/CD pipelines for machine learning applications
Strong proficiency with containerization (Docker) and orchestration tools (Kubernetes)
Proficiency in deploying machine learning models using frameworks such as TensorFlow Serving, TorchServe, or custom APIs
Deep understanding of cloud infrastructure and services (AWS, GCP, or Azure) for ML workloads, including GPUs and TPU utilization
Experience managing large-scale distributed training workflows and optimizing resource allocation
Familiarity with tools like MLflow, DVC, Weight+Biases, or similar for data and model tracking and versioning
Solid understanding of security best practices for machine learning systems and sensitive data handling
Strong scripting and programming skills in Python, Bash, or Go

Preferred

Experience with data orchestration tools like DataChain, Weights and Biases, etc, for managing ML workflows
Hands-on experience with automated hyperparameter tuning and optimization frameworks
Familiarity with model monitoring tools like Prometheus, Grafana, or custom solutions for model drift and data quality checks
Experience integrating pre-trained foundational models and managing their deployment at scale
Contributions to open-source ML Ops projects or relevant research publications

Benefits

A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.

Company

The Walt Disney Company

twittertwittertwitter
company-logo
The Walt Disney Company started as a cartoon studio and evolves into sports coverage and television shows.

H1B Sponsorship

The Walt Disney Company has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (83)
2024 (63)
2023 (96)
2022 (130)
2021 (30)
2020 (40)

Funding

Current Stage
Public Company
Total Funding
$11B
Key Investors
Citibank
2020-04-13Post Ipo Debt· $5B
2020-03-20Post Ipo Debt· $6B
1978-01-06IPO

Leadership Team

leader-logo
Robert Iger
CEO
leader-logo
Christine M. McCarthy
Senior Executive Vice President & Chief Financial Officer
linkedin
Company data provided by crunchbase