This job has closed.

NVIDIA · 3 days ago

Senior HPC Performance Engineer

Santa Clara, CA

Full-time

Onsite

Mid, Senior Level

$148K/yr - $288K/yr

3+ years exp

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. They are seeking a motivated Performance Engineer to influence the roadmap of their communication libraries, focusing on performance characterization and analysis on large multi-GPU and multi-node clusters.

AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality

Growth Opportunities

H1B Sponsor Likely

Responsibilities

Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters

Study the interaction of our libraries with all HW (GPU, CPU, Networking) and SW components in the stack

Evaluate proof-of-concepts, conduct trade-off analysis when multiple solutions are available

Triage and root-cause performance issues reported by our customers

Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information

Collaborate with a very dynamic team across multiple time zones

Qualification

HPC experienceParallel programmingPerformance benchmarkingC/C++ programmingPythonComputer architectureNetworking knowledgeContainersCloud toolsAdaptabilityTeam collaboration

Required

M.S. (or equivalent experience) or PHD in Computer Science, or related field with relevant performance engineering and HPC experience

3+ yrs of experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM)

Experience conducting performance benchmarking and triage on large scale HPC clusters

Good understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals)

Implement micro-benchmarks in C/C++, read and modify the code base when required

Ability to debug performance issues across the entire HW/SW stack. Proficient in a scripting language, preferably Python

Familiar with containers, cloud provisioning and scheduling tools (Kubernetes, SLURM, Ansible, Docker)

Adaptability and passion to learn new areas and tools. Flexibility to work and communicate effectively across different teams and timezones

Preferred

Practical experience with Infiniband/Ethernet networks in areas like RDMA, topologies, congestion control

Experience debugging network issues in large scale deployments

Familiarity with CUDA programming and/or GPUs

Experience with Deep Learning Frameworks such PyTorch, TensorFlow

Benefits

Equity

Benefits

Company

NVIDIA

Glassdoor4.6

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

Founded in 1993

Santa Clara, California, USA

10001+ employees

https://www.nvidia.com

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (1877)

2024 (1355)

2023 (976)

2022 (835)

2021 (601)

2020 (529)

Funding

Current Stage

Public Company

Total Funding

$4.09B

Key Investors

ARPA-EARK Investment ManagementSoftBank Vision Fund

2023-05-09Grant· $5M

2022-08-09Post Ipo Equity· $65M

2021-02-18Post Ipo Equity

Leadership Team

Jensen Huang

Founder and CEO

Michael Kagan

Chief Technology Officer

Recent News

SiliconANGLE

Red Hat pledges day-zero support for Nvidia’s newest GPUs

2026-01-11

PitchBook

Map: The VCs benefiting from Nvidia’s M&A

2026-01-11

Digital News Asia

Republic Polytechnic accelerates AI transformation to develop future-ready learners and an AI-adept workforce

2026-01-11

Company data provided by crunchbase