Senior Generative AI Research Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

NVIDIA · 1 day ago

Senior Generative AI Research Engineer

NVIDIA is a leading technology company that is at the forefront of AI computing. They are seeking a Senior Generative AI Research Engineer to design and post-train foundation models for real-world applications, collaborate on large-scale training infrastructure, and mentor junior engineers in the field of generative AI.

AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design and post-train foundation models (LLMs, VLMs, VLAs and DiTs) for real world applications
Contribute to highly-collaborative development on large-scale training infrastructure, high-efficiency inference pipelines, and scalable data pipelines
Work with teams in research, software, and product to bring world models from idea to deployment
Collaborate on open-source and internal projects, author technical papers or patents, and mentor junior engineers
Prototype and iterate rapidly on experiments across cutting-edge AI domains, including agentic systems, reinforcement learning, reasoning, and video generation
Design and implement model distillation algorithms for size reduction and diffusion step optimization
Profile and benchmark training and inference pipelines to achieve production-ready performance requirements

Qualification

Generative AI systemsPyTorchDeep learning frameworksTransformer architecturesLarge scale trainingData processingProduction-quality software engineeringComputer ScienceHigh-performance computingMultimodal dataNVIDIA GPU-based compute

Required

Stellar experience building and deploying generative AI systems (minimum 8 years industry or 5+ years research/postdoc)
Proficiency in PyTorch, JAX, or other deep learning frameworks is a must!
Expertise in one or more of: LLMs, coding agents, diffusion models, autoregressive models, VAE/GAN architectures, retrieval-augmented generation, neural rendering, or multi-agent systems
Intimately familiar with all variants of the attention mechanisms
Hands on experience with large scale training (e.g., ZeRO, DDP, FSDP, TP, CP) and data processing (e.g. Ray, Spark)
Production-quality software engineering skills in Python
MS or PhD or equivalent experience in Computer Science, Machine Learning, Applied Math, Physics, or a related field
12+ years of relevant software development experience

Preferred

Familiarity with high-performance computing and GPU acceleration
Contributions to influential open-source libraries or influential conference publications (NeurIPS, ICML, CVPR, ICLR)
Experience working with multimodal data (e.g., vision-language, VLA, audio)
Prior work with NVIDIA GPU-based compute clusters or simulation environments

Benefits

Equity
Benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase