NVIDIA · 2 days ago
Senior Deep Learning Software Engineer, Inference
NVIDIA is seeking a Senior Software Engineer specializing in Deep Learning Inference for their growing team. The role involves designing, building, and optimizing GPU-accelerated software for AI applications, focusing on performance improvements for large-scale model serving and inference. You will collaborate with the deep learning community to implement algorithms and enhance NVIDIA's inference libraries.
Responsibilities
Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI
Scale performance of DL models across different architectures and types of NVIDIA accelerators
Contribute features and code to NVIDIA’s inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions
Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions
Qualification
Required
Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI)
5+ years of relevant software development experience
Excellent C/C++ programming and software design skills
SW Agile skills are helpful and Python experience is a plus
Preferred
Prior experience with training, deploying or optimizing the inference of DL models in production is a plus
Prior background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus
GPU programming experience (CUDA, OAI TRITON or CUTLASS) is a plus
Experience with Multi GPU Communications (NCCL, NVSHMEM)
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
2026-01-25
Unified Communications fuel big enterprise success | CIO
2026-01-25
2026-01-25
Company data provided by crunchbase