Senior Software Engineer - AI Research Clusters jobs in United States
info-icon
This job has closed.
company-logo

NVIDIA · 1 week ago

Senior Software Engineer - AI Research Clusters

NVIDIA is at the forefront of innovations in Artificial Intelligence, High-Performance Computing, and Visualization. They are seeking a Senior Software Engineer to help accelerate the next era of machine learning innovation by proposing and implementing engineering solutions for GPU clusters.

AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

In this position, you will work with coworkers across the AI Platform organization to understand the pain points of validating, monitoring and operating GPU clusters at scale. Then you will design, develop and maintain engineering solutions to solve those pain points systematically
You will also research in traditional AIOps and the emerging Agentic AI, and leverage it to further reduce the operation toil
You will participate in on-call support for systems, platforms built and owned by the team

Qualification

ML infrastructureDistributed systemsPythonC++RustDockerKubernetesAIOpsAgentic AIFull-stack developmentRelational Data ModelingDB optimizationREST API SemanticsJavascriptCSSSlurmGPU computingLinux systems internalsPerformance tuning

Required

BS/MS in Computer Science, Engineering, or equivalent experience
8+ years in software/platform engineering, including 3+ years in ML infrastructure or distributed systems
Experience in software development lifecycle on Linux-based platforms
Strong coding skills in languages such as Python, C++ or Rust
Experience with Docker, Kubernetes, GitLab CI, automated deployments
Experience with AIOps or Agentic AI and apply it successfully in production environment

Preferred

Proficiency with full-stack development: Relational Data Modeling, DB optimization, REST API Semantics, Javascript, CSS, providing API as a service
Passion for building developer-centric platforms with great UX and strong operational reliability
Experience running Slurm or custom scheduling frameworks in production ML environments
Familiarity with GPU computing, Linux systems internals, and performance tuning at scale

Benefits

Equity
Benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase