NVIDIA · 1 day ago
Senior Generative AI Research Engineer
NVIDIA is a leading technology company that is at the forefront of AI computing. They are seeking a Senior Generative AI Research Engineer to design and post-train foundation models for real-world applications, collaborate on large-scale training infrastructure, and mentor junior engineers in the field of generative AI.
AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
Responsibilities
Design and post-train foundation models (LLMs, VLMs, VLAs and DiTs) for real world applications
Contribute to highly-collaborative development on large-scale training infrastructure, high-efficiency inference pipelines, and scalable data pipelines
Work with teams in research, software, and product to bring world models from idea to deployment
Collaborate on open-source and internal projects, author technical papers or patents, and mentor junior engineers
Prototype and iterate rapidly on experiments across cutting-edge AI domains, including agentic systems, reinforcement learning, reasoning, and video generation
Design and implement model distillation algorithms for size reduction and diffusion step optimization
Profile and benchmark training and inference pipelines to achieve production-ready performance requirements
Qualification
Required
Stellar experience building and deploying generative AI systems (minimum 8 years industry or 5+ years research/postdoc)
Proficiency in PyTorch, JAX, or other deep learning frameworks is a must!
Expertise in one or more of: LLMs, coding agents, diffusion models, autoregressive models, VAE/GAN architectures, retrieval-augmented generation, neural rendering, or multi-agent systems
Intimately familiar with all variants of the attention mechanisms
Hands on experience with large scale training (e.g., ZeRO, DDP, FSDP, TP, CP) and data processing (e.g. Ray, Spark)
Production-quality software engineering skills in Python
MS or PhD or equivalent experience in Computer Science, Machine Learning, Applied Math, Physics, or a related field
12+ years of relevant software development experience
Preferred
Familiarity with high-performance computing and GPU acceleration
Contributions to influential open-source libraries or influential conference publications (NeurIPS, ICML, CVPR, ICLR)
Experience working with multimodal data (e.g., vision-language, VLA, audio)
Prior work with NVIDIA GPU-based compute clusters or simulation environments
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
Tech Startups - Tech News, Tech Trends & Startup Funding
2026-01-22
Dynamic Business
2026-01-22
2026-01-22
Company data provided by crunchbase