AI Research Scientist, Media Data Research - MSL FAIR jobs in United States
cer-icon
Apply on Employer Site
company-logo

Meta · 2 days ago

AI Research Scientist, Media Data Research - MSL FAIR

Meta is seeking AI research scientists to help build the data foundation for their advanced Large Language and Media Models. The role involves collaborating with teams to develop foundational models, improving data workflows, and leading technical projects in the realm of data curation.

Computer Software
check
Comp. & Benefits

Responsibilities

Collaborate with cross-functional teams to develop Meta’s next foundational models
Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
Execute on high priority projects in pre-training, mid-training, or post-training data curation
Apply specialized expertise in video/image generation, video/image perception, OCR, data scaling laws, or data mixing
Lead complex technical projects end-to-end

Qualification

LLM/NLP expertiseComputer visionMultimodal pre-trainingPython programmingPyTorchData curationPublished researchSQL familiarityTechnical leadership

Required

Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
PhD in Computer Science or a related technical field
2+ years of industry research experience in LLM/NLP, computer vision, or related AI/ML models
Experience as a formal technical lead, leading major technical initiatives with cross-functional impact, and/or influencing strategy across multiple teams
Practical experience with multimodal pre-training or mid-training data curation for large media perception or generation models
Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI

Preferred

Experience working on frontier-quality/ state-of-the-art Large Language or Large Media Models
First-author publications at top peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV)
Programming experience in Python and hands-on experience with frameworks like PyTorch or Spark, or related distributed computing frameworks (Ray, DataFlow)
Familiarity with SQL and file formats, such as Hive, Iceberg, Parquet, etc

Benefits

Bonus
Equity
Benefits

Company

Meta's mission is to build the future of human connection and the technology that makes it possible.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Kathryn Glickman
Director, CEO Communications
linkedin
leader-logo
Christine Lu
CTO Business Engineering NA
linkedin
Company data provided by crunchbase