NVIDIA · 6 hours ago
Senior Deep Learning Inference Performance Architect
NVIDIA is seeking a Senior Deep Learning Inference Performance Architect who will focus on accelerating AI Inference workloads through innovative hardware-software co-design. The role involves writing performance optimized code, analyzing deep learning algorithms, and collaborating with various teams to guide AI direction.
Responsibilities
Develop innovative GPU and system architectures to extend the state of the art in AI Inference performance and efficiency
Model, analyze and prototype key deep learning algorithms and applications
Understand and analyze the interplay of hardware and software architectures on future algorithms and applications
Write efficient software for AI Inference, including CUDA kernels, framework level code, and application level code
Collaborate across the company to guide the direction of AI, working with software, research and product teams
Qualification
Required
A MS or PhD in a relevant discipline (CS, EE, Math) or equivalent experience, with 5+ years or relevant experience
Strong mathematical foundation in machine learning and deep learning
Expert programming skills in C, C++, and Python
Familiarity with GPU computing (CUDA or similar) and HPC (MPI, OpenMP)
Strong knowledge and coursework in computer architecture
Preferred
Background with systems-level performance modeling, profiling, and analysis
Experience in characterizing and modeling system-level performance, executing comparison studies, and documenting and publishing results
Experience in optimizing AI Inference workloads with CUDA kernel development
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
2026-01-01
2026-01-01
Business Insider
2026-01-01
Company data provided by crunchbase