disney · 3 hours ago
Sr Data Scientist
Disney is seeking a Sr Data Scientist to join their R&D teams at Lucasfilm and ILM, focusing on Generative AI. The role involves developing a robust data curation pipeline and ensuring the quality of training datasets for advanced machine learning applications.
Audio Recording and ProductionMusic
Responsibilities
Independently design and implement statistical methods to ensure curated datasets retain representative coverage across various visual attributes, stylistic choices, and subject matter
Develop logic to identify and down-weight low-variance or repetitive data points to maximize training efficiency
Collaborate with key stakeholders on algorithms for de-duplication to automatically eliminate redundant or near-identical assets from the training corpus
Design and lead implementation of automated metrics to assess the quality of generative images and videos
Validate automated quantitative metrics by correlating them against qualitative feedback provided by senior creative stakeholders
Establish success criteria for model fidelity, accuracy, and stylistic consistency
Work closely with the engineering team to integrate data cleaning, normalization, and sampling modules into a scalable automated pipeline
Assist in defining taxonomy and metadata standards to systematically organize unstructured visual assets
Phase 1: defining data taxonomy and establishing baseline automated metrics
Phase 2: refining metrics for temporal consistency and validating against initial model fine-tuning runs
Phase 3: final validation of metrics and delivery of fully curated, optimized datasets for cold storage
Qualification
Required
5+ years experience in related field
Education - Bachelor's degree in Data Science, Computer Science, or a related field of study, and/or equivalent work experience
Proven background in Data Science with a strong emphasis on Computer Vision, Generative AI, or Deep Learning
Proficiency in statistical analysis and dataset curation (distribution analysis, sampling techniques)
Ability to translate complex statistical insights for engineering partners and non-technical creative leads
Preferred
Master's Degree preferred
Experience working with large-scale unstructured media data is a plus
Familiarity with standard and novel metrics for evaluating Generative Models (e.g., FID, FVD, or similar)
Benefits
A bonus and/or long-term incentive units may be provided as part of the compensation package
The full range of medical, financial, and/or other benefits
Company
disney
disney.com
H1B Sponsorship
disney has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (83)
2024 (63)
2023 (96)
2022 (130)
2021 (30)
2020 (40)
Funding
Current Stage
Early StageCompany data provided by crunchbase