Research Scientist (diffusion) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Genmo · 2 months ago

Research Scientist (diffusion)

Genmo is a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. They are seeking an exceptional Research Scientist to develop cutting-edge diffusion models for text-to-video generation, focusing on creating novel architectures and algorithms.

Artificial Intelligence (AI)ContentDigital Entertainment
check
H1B Sponsor Likelynote

Responsibilities

Lead research initiatives in advanced diffusion models for text-to-video generation, focusing on improving visual quality, temporal consistency, and semantic fidelity
Develop and implement state-of-the-art algorithms for translating textual descriptions into dynamic video content
Design and conduct rigorous experiments to validate new ideas and evaluate model performance
Collaborate with cross-functional teams to integrate research breakthroughs into our production pipeline
Stay at the cutting edge of the field by regularly reviewing academic literature and attending top-tier conferences
Contribute to the research community through high-quality publications and open-source contributions
Mentor junior researchers and foster a culture of innovation within the research team
Work closely with product teams to align research directions with user needs and market opportunities

Qualification

Ph.D. in AIMLPublication recordGenerative modelsPythonDeep learning frameworksText-to-video generationCollaboration skillsOpen-source contributionsCommunication skillsMentoring abilities

Required

Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a closely related field
Strong publication record in top-tier conferences (e.g., CVPR, ICCV, NeurIPS, ICML) with a focus on generative models, particularly diffusion models
Extensive experience implementing and optimizing large-scale generative models for image or video tasks
Deep understanding of state-of-the-art techniques in text-to-image and text-to-video generation
Proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow
Excellent communication skills with the ability to explain complex technical concepts to diverse audiences
Proven ability to work collaboratively in a team environment

Preferred

Postdoctoral or industrial research experience in generative AI for video
Hands-on experience with text-to-video generation projects
Expertise in other generative model architectures (e.g., GANs, VAEs) and their applications to video
Experience working with large-scale datasets and distributed computing environments
Track record of successful collaboration with product teams on technology transfers
Familiarity with video codecs, compression techniques, and perceptual quality metrics
Contributions to open-source projects in the field of generative AI

Company

Genmo

twittertwittertwitter
company-logo
Genmo is an artificial intelligence creative content generation platform that specializes in developing creative products.

H1B Sponsorship

Genmo has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Early Stage
Total Funding
$58.4M
Key Investors
New Enterprise Associates
2024-10-22Series A· $28.4M
2024-02-27Series Unknown· $30M

Leadership Team

leader-logo
Ajay Jain
Co-Founder and CTO
linkedin
Company data provided by crunchbase