Senior Generative AI Research Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

NVIDIA · 2 days ago

Senior Generative AI Research Engineer

NVIDIA is a leading technology company specializing in AI computing and generative modeling. They are seeking a Senior Generative AI Research Engineer to design and post-train foundation models and contribute to the development of large-scale training infrastructure and inference pipelines.

AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design and post-train foundation models (LLMs, VLMs, VLAs and DiTs) for real world applications
Contribute to highly-collaborative development on large-scale training infrastructure, high-efficiency inference pipelines, and scalable data pipelines
Work with teams in research, software, and product to bring world models from idea to deployment
Collaborate on open-source and internal projects, author technical papers or patents, and mentor junior engineers
Prototype and iterate rapidly on experiments across cutting-edge AI domains, including agentic systems, reinforcement learning, reasoning, and video generation
Design and implement model distillation algorithms for size reduction and diffusion step optimization
Profile and benchmark training and inference pipelines to achieve production-ready performance requirements

Qualification

Generative AI systemsDeep learning frameworksTransformer architecturesLarge scale trainingPythonModel distillation algorithmsMultimodal dataTechnical writingMentoringCollaboration

Required

Minimum 8 years industry or 5+ years research/postdoc experience building and deploying generative AI systems
Proficiency in PyTorch, JAX, or other deep learning frameworks
Expertise in one or more of: LLMs, coding agents, diffusion models, autoregressive models, VAE/GAN architectures, retrieval-augmented generation, neural rendering, or multi-agent systems
Intimately familiar with all variants of the attention mechanisms in transformer architectures
Hands on experience with large scale training (e.g., ZeRO, DDP, FSDP, TP, CP) and data processing (e.g. Ray, Spark)
Production-quality software engineering skills in Python
MS or PhD or equivalent experience in Computer Science, Machine Learning, Applied Math, Physics, or a related field
12+ years of relevant software development experience

Preferred

Familiarity with high-performance computing and GPU acceleration
Contributions to influential open-source libraries or influential conference publications (NeurIPS, ICML, CVPR, ICLR)
Experience working with multimodal data (e.g., vision-language, VLA, audio)
Prior work with NVIDIA GPU-based compute clusters or simulation environments

Benefits

Equity
Benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase