Senior Software Engineer - NIM Factory Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

NVIDIA · 1 day ago

Senior Software Engineer - NIM Factory Infrastructure

NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a senior engineer to design and build factory automation for NVIDIA Inference Microservices (NIMs) to optimize and serve performant inferencing for AI models. The role involves developing infrastructure, collaborating with teams, and driving technical advances in workflows for AI model deployment.

Artificial Intelligence (AI)Consumer ElectronicsGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Develop, analyze and optimize factory infrastructure that will take an AI model in and produce a deployable service that is validated across Cloud, On-prem and Kubernetes environments
With the team, define and deliver rapid iterations on the group's technical strategies and roadmaps to deliver and improve the NIM factory
You will be developing harness, automating hardware acceptance, analyze benchmarks, data gathering and statistical analysis of systems health and performance analysis of NIMs
Work with technical leaders designing and developing scalable and reliable factory acceptance and performance tuning of hardware platforms
You will collaborate with multiple AI model teams to understand their requirements to build an efficient infrastructure that improves every team's productivity
Define metrics and drive improvements based on user feedback
You will mentor and collaborate throughout the team and with other teams to grow your colleagues and yourself

Qualification

Advanced programming skillsSystem software expertisePerformance analysisCloud software developmentHardware system characterizationDistributed systemsNetworking expertiseGPU interconnectsInterpersonal skillsMentoringCollaborationProblem-solving

Required

A history of using your advanced programming skills to build tooling and automation for hardware system characterization and benchmarking
Proven experience debugging and analyzing performance of compute applications and system
Deep technical expertise working with system software and platform layers including Kernel, device driver, memory, storage, networking and PCIe devices
Experience working with hardware clusters, distributed system, networking, GPU interconnects (PCie, NVlink), node and cluster interconnect (InfiniBand)
Passion for building platform engineering components and automation of system benchmarking and characterization
Excellent interpersonal skills and the ability to lead multi-functional efforts
BS or MS in Computer Science, Computer Engineering or related field (or equivalent experience)
5+ years of shown experience developing performant microservice, cloud software and/or tooling roles

Preferred

Experience delivering optimized system engineering environment for inference applications in data center and consumer grade hardware platforms
A history of building and deploying automated benchmarking solution in Cloud and On-prem environments, and their associated CI/CD pipelines
Prior experience in working with large scale compute infrastructure solution

Benefits

Equity and benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase