Engineering Manager, Machine Learning, Model Evaluations and Data Curation (AI Foundations) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Netflix · 8 hours ago

Engineering Manager, Machine Learning, Model Evaluations and Data Curation (AI Foundations)

Netflix is one of the world's leading entertainment services, with over 300 million paid memberships globally. They are seeking an Engineering Manager to lead a team focused on model evaluations and data curation for large language models, driving innovation in personalization and discovery.

Digital EntertainmentMedia and EntertainmentTVVideo Streaming
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Partner with downstream AI application teams to define shared evaluations that codify application expectations of LLMs and other foundation models, ensuring progress can be transparently tracked against real-world needs
Design rigorous benchmarks and evaluation methodologies across ranking & recommendations, content understanding, and language/text generation — grounded in a deep technical understanding of LLMs, their strengths, limitations, and failure modes
Lead the development of evaluators and strong baselines to ensure in-house LLMs and other foundation models demonstrate clear advantages over off-the-shelf alternatives
Build scalable, reproducible data and evaluation systems that make dataset creation and evaluation design as nimble and experiment-friendly as model development itself
Hire, grow, and nurture a world-class team, fostering an inclusive, high-performing culture that balances research innovation with engineering excellence
Work closely with the teams developing Netflix’s foundation models (including our core LLM) to ensure evaluation and data insights are folded back into the cadence of model development. Proactively influence the ML Platform and Data Engineering teams at key interfaces

Qualification

Machine LearningLarge Language ModelsEvaluation MethodologiesData InfrastructureBenchmark DesignTechnical ExpertiseTeam LeadershipCross-functional CollaborationCommunication Skills

Required

Experience building and leading high-performing teams of ML researchers and engineers
Proven track record of leading machine learning initiatives from research to production, ideally involving evaluation frameworks, ML infrastructure, or data-intensive systems
Strong technical expertise in LLMs, their evaluation, and practical methods for ensuring robustness, reproducibility, and quality
Broad knowledge of machine learning fundamentals and evaluation methodologies, including benchmark design, model-based evaluators, and offline/online metrics
Experience driving cross-functional projects, including close collaboration with AI application teams to translate product needs into evaluation frameworks
Excellent written and verbal communication skills, able to bridge technical and non-technical audiences
Advanced degree in Computer Science, Statistics, or a related quantitative field

Preferred

8+ years of overall experience, including 3+ years in engineering management
Experience with large-scale ML systems and foundation models, especially LLMs
Background in building evaluation frameworks, model benchmarking, or data infrastructure for LLM training
Familiarity with multi-modal data and evaluation

Benefits

Health Plans
Mental Health support
A 401(k) Retirement Plan with employer match
Stock Option Program
Disability Programs
Health Savings and Flexible Spending Accounts
Family-forming benefits
Life and Serious Injury Benefits
Paid leave of absence programs
Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off.
Full-time salaried employees are immediately entitled to flexible time off.

Company

Netflix is an online streaming platform that enables users to watch TV shows and movies.

H1B Sponsorship

Netflix has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (230)
2024 (309)
2023 (191)
2022 (261)
2021 (268)
2020 (225)

Funding

Current Stage
Public Company
Total Funding
$63.91B
Key Investors
Wells FargoTCVGroupe Arnault
2025-12-05Post Ipo Debt· $59B
2024-08-01Post Ipo Debt· $1.8B
2018-05-05Post Ipo Debt· $2.67M

Leadership Team

leader-logo
Gregory Peters
Co-CEO
linkedin
leader-logo
Ted Sarandos
Co-CEO
linkedin
Company data provided by crunchbase