NVIDIA · 1 day ago
Senior Software Engineer - NIM Factory Infrastructure
NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a senior engineer to design and build factory automation for NVIDIA Inference Microservices (NIMs) to optimize and serve performant inferencing for AI models. The role involves developing infrastructure, collaborating with teams, and driving technical advances in workflows for AI model deployment.
Artificial Intelligence (AI)Consumer ElectronicsGPUHardwareSoftwareVirtual Reality
Responsibilities
Develop, analyze and optimize factory infrastructure that will take an AI model in and produce a deployable service that is validated across Cloud, On-prem and Kubernetes environments
With the team, define and deliver rapid iterations on the group's technical strategies and roadmaps to deliver and improve the NIM factory
You will be developing harness, automating hardware acceptance, analyze benchmarks, data gathering and statistical analysis of systems health and performance analysis of NIMs
Work with technical leaders designing and developing scalable and reliable factory acceptance and performance tuning of hardware platforms
You will collaborate with multiple AI model teams to understand their requirements to build an efficient infrastructure that improves every team's productivity
Define metrics and drive improvements based on user feedback
You will mentor and collaborate throughout the team and with other teams to grow your colleagues and yourself
Qualification
Required
A history of using your advanced programming skills to build tooling and automation for hardware system characterization and benchmarking
Proven experience debugging and analyzing performance of compute applications and system
Deep technical expertise working with system software and platform layers including Kernel, device driver, memory, storage, networking and PCIe devices
Experience working with hardware clusters, distributed system, networking, GPU interconnects (PCie, NVlink), node and cluster interconnect (InfiniBand)
Passion for building platform engineering components and automation of system benchmarking and characterization
Excellent interpersonal skills and the ability to lead multi-functional efforts
BS or MS in Computer Science, Computer Engineering or related field (or equivalent experience)
5+ years of shown experience developing performant microservice, cloud software and/or tooling roles
Preferred
Experience delivering optimized system engineering environment for inference applications in data center and consumer grade hardware platforms
A history of building and deploying automated benchmarking solution in Cloud and On-prem environments, and their associated CI/CD pipelines
Prior experience in working with large scale compute infrastructure solution
Benefits
Equity and benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
2026-01-08
Company data provided by crunchbase