Data Center GPU Performance Engineer – Product jobs in United States
cer-icon
Apply on Employer Site
company-logo

NVIDIA · 4 months ago

Data Center GPU Performance Engineer – Product

NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. The Data Center GPU Performance Engineer will guide GPU architecture for datacenter applications and drive innovation opportunities in GPU and data-center architecture.

AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Guide the architecture of the next-generation of GPUs through an intuitive and comprehensive grasp of how GPU architecture affects performance for datacenter applications, especially Large Language Models (LLMs)
Drive the discovery of opportunities for innovation in GPU, system, and data-center architecture by analyzing the latest data center workload trends, Deep Learning (DL) research, analyst reports, competitive landscape, and token economics
Find opportunities where we uniquely can address customer needs, and translate these into compelling GPU value proposition and product proposals
Distill sophisticated analyses into clear recommendations for both technical and non-technical audiences

Qualification

GPU architectureMachine LearningDeep LearningLarge Language ModelsData center economicsProduct managementCollaborationProblem-solvingMotivation to learn

Required

5+ years of total experience in technology with previous product management, AI related engineering, design or development experience highly valued
BS or MS or equivalent experience in engineering, computer science, or another technical field. MBA a plus
Deep understanding of fundamentals of GPU architecture, Machine Learning, Deep Learning, and LLM architecture with ability to articulate relationship between application performance and GPU and data center architecture
Ability to develop intuitive models on the economics of data center workloads including data center total cost of operation and token revenues
Demonstrated ability to fully contribute to above areas within 3 months
Strong desire to learn, motivated to tackle complex problems and the ability to make sophisticated trade-offs

Preferred

2+ years direct experience in developing or deploying large scale GPU based AI applications, like LLMs, for training and inference
Ability to quickly develop intuitive, first-principles based models of Generative AI workload performance using GPU and system architecture (FLOPS, bandwidths, etc.)
Comfort and drive to constantly stay updated with the latest in deep learning research (academic papers) and industry news
Track record of managing multiple parallel efforts, collaborating with diverse teams, including performance engineers, hardware architects, and product managers

Benefits

Equity
Comprehensive benefits package

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase