The Walt Disney Company · 3 weeks ago
Sr ML Ops Engineer
The Walt Disney Company is seeking a highly skilled Sr ML Ops Engineer to build and maintain the infrastructure powering their machine learning and AI frameworks. This role is crucial for enabling seamless workflows for model training, retraining, and deployment, ensuring that AI solutions operate reliably at scale.
Amusement Park and ArcadeAnimationConsumer GoodsDigital MediaE-CommerceMedia and EntertainmentMulti-level MarketingPerforming ArtsResorts
Responsibilities
Develop, deploy, and maintain scalable infrastructure for machine learning model training, retraining, and inference
Design and optimize CI/CD pipelines specifically tailored for machine learning workflows, ensuring efficient delivery from research to production
Implement robust monitoring and logging systems to track model performance and identify potential issues in production environments
Collaborate with AI researchers and data scientists to ensure infrastructure aligns with project requirements and supports iterative experimentation
Manage compute resources (cloud and on-premises) to enable large-scale distributed training and inference tasks
Containerize machine learning models and applications using Docker and deploy them via Kubernetes or equivalent orchestration systems
Automate deployment workflows for serving ML models using frameworks such as TorchServe, TensorFlow Serving and FastAPI
Implement model versioning, rollback strategies, and governance for maintaining production stability
Optimize cost efficiency and performance of machine learning workflows in cloud environments such as AWS, GCP, or Azure
Stay updated with emerging ML Ops tools and practices, integrating them into existing workflows to improve performance and reliability
Qualification
Required
Bachelor's in Computer Science, Engineering, or a related field
5+ years of experience in DevOps, Site Reliability Engineering, or a related role, with at least 2+ years focusing on ML Ops
Expertise in building and maintaining CI/CD pipelines for machine learning applications
Strong proficiency with containerization (Docker) and orchestration tools (Kubernetes)
Proficiency in deploying machine learning models using frameworks such as TensorFlow Serving, TorchServe, or custom APIs
Deep understanding of cloud infrastructure and services (AWS, GCP, or Azure) for ML workloads, including GPUs and TPU utilization
Experience managing large-scale distributed training workflows and optimizing resource allocation
Familiarity with tools like MLflow, DVC, Weight+Biases, or similar for data and model tracking and versioning
Solid understanding of security best practices for machine learning systems and sensitive data handling
Strong scripting and programming skills in Python, Bash, or Go
Preferred
Experience with data orchestration tools like DataChain, Weights and Biases, etc, for managing ML workflows
Hands-on experience with automated hyperparameter tuning and optimization frameworks
Familiarity with model monitoring tools like Prometheus, Grafana, or custom solutions for model drift and data quality checks
Experience integrating pre-trained foundational models and managing their deployment at scale
Contributions to open-source ML Ops projects or relevant research publications
Benefits
A bonus and/or long-term incentive units may be provided as part of the compensation package
Full range of medical, financial, and/or other benefits
Company
The Walt Disney Company
The Walt Disney Company started as a cartoon studio and evolves into sports coverage and television shows.
H1B Sponsorship
The Walt Disney Company has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (83)
2024 (63)
2023 (96)
2022 (130)
2021 (30)
2020 (40)
Funding
Current Stage
Public CompanyTotal Funding
$11BKey Investors
Citibank
2020-04-13Post Ipo Debt· $5B
2020-03-20Post Ipo Debt· $6B
1978-01-06IPO
Leadership Team
Recent News
2025-12-24
Company data provided by crunchbase