Research Scientist (post-training) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Genmo · 2 months ago

Research Scientist (post-training)

Genmo is a research lab dedicated to building state-of-the-art models for video generation. They are seeking an exceptional Research Scientist to focus on alignment and post-training techniques for large-scale video generation models, ensuring high-quality and safe outputs that align with human preferences.

Artificial Intelligence (AI)ContentDigital Entertainment
check
H1B Sponsor Likelynote

Responsibilities

Lead research initiatives in alignment and post-training methods for video generation models, focusing on improved quality, reliability, and adherence to human intent
Design and implement supervised fine-tuning and reinforcement learning from human feedback (RLHF) pipelines for video generation models
Develop robust evaluation frameworks to measure model alignment, safety, and output quality
Create and optimize data collection pipelines for human feedback and preferences
Design and conduct experiments to validate alignment techniques and their scaling properties
Collaborate with cross-functional teams to integrate alignment improvements into our production pipeline
Stay at the cutting edge of the field by regularly reviewing academic literature in both generative AI and alignment
Mentor junior researchers and foster a culture of responsible AI development
Work closely with product teams to ensure alignment methods enhance rather than inhibit model capabilities

Qualification

Ph.D. in AIReinforcement learningPyTorchLarge-scale trainingEvaluation frameworksDiffusion modelsHuman feedback dataPerceptual quality metricsSoftware engineeringCollaboration with product teamsOpen-source contributionsCommunication skills

Required

Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a closely related field
Strong publication record in top-tier conferences (e.g., NeurIPS, ICML, ICLR) with a focus on reinforcement learning, alignment, or generative models
Extensive experience implementing and optimizing large-scale training pipelines using PyTorch
Deep understanding of reinforcement learning techniques, particularly RLHF
Experience with distributed training systems and large-scale experiments
Proven track record in designing and implementing robust evaluation frameworks
Excellent communication skills with the ability to explain complex technical concepts to diverse audiences
Strong software engineering skills and experience with complex shared codebases

Preferred

Experience with diffusion models or other generative architectures
Background in fine-tuning large language models or generative models
Experience working with human feedback data collection and annotation pipelines
Strong aesthetic sense and understanding of video quality assessment
Familiarity with alignment techniques such as constitutional AI or debate
Track record of successful collaboration with product teams
Experience with perceptual quality metrics and human evaluation design
Contributions to open-source projects in AI alignment or generative AI

Company

Genmo

twittertwittertwitter
company-logo
Genmo is an artificial intelligence creative content generation platform that specializes in developing creative products.

H1B Sponsorship

Genmo has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Early Stage
Total Funding
$58.4M
Key Investors
New Enterprise Associates
2024-10-22Series A· $28.4M
2024-02-27Series Unknown· $30M

Leadership Team

leader-logo
Ajay Jain
Co-Founder and CTO
linkedin
Company data provided by crunchbase