Senior Software Engineer, Deep Learning - Torch-TRT jobs in United States
cer-icon
Apply on Employer Site
company-logo

NVIDIA · 2 weeks ago

Senior Software Engineer, Deep Learning - Torch-TRT

NVIDIA has been transforming computer graphics and accelerated computing for over 25 years. They are seeking a Senior Deep Learning Software Engineer to develop and productize deep learning solutions for autonomous driving vehicles, focusing on compiler technology and optimization for NVIDIA hardware.

AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Developing compiler technologies to run various classes of model architecture (Transformer, Diffusion, VLA, CNN, RNN etc.) on NVIDIA hardware leveraging techniques such as reduced precision, quantization, workload scheduling and memory bandwidth optimization
Working across the whole lifetime of a model: training, fine-tuning, optimization to allow customers to access pioneering models on NVIDIA hardware
Develop workflows that let users leverage frameworks (e.g. PyTorch, JAX) and ecosystem tools (HuggingFace, MLIR) without forgoing performance
Stay up to date with the latest research and innovations in deep learning, implement and experiment with new insights to improve NVIDIA's Physical AI DNNs

Qualification

Deep LearningPythonC/C++GPU ArchitecturesCompiler TechnologiesModel OptimizationNVIDIA Software LibrariesAnalytical SkillsTime ManagementOrganization Skills

Required

MS or PhD degree in computer science, computer vision, robotics, computer architecture or equivalent experience in technical field
5+ years of work experience in software development
2+ years of experience in implementing deep learning models and optimizations such as graph fusions, kernel implementation, KV Caching etc
Domain experience in current innovative deep learning methods (e.g. diffusion models, vision language action models, etc.)
Strong Python and/or C/C++ programming skills
Proven technical foundation in CPU and GPU architectures, containers (nvidia-docker), numeric libraries, modular software design
Willing to take action and have strong analytical skills
Strong time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very sophisticated projects

Preferred

Background with low precision inference, quantization, compression of DNNs
Experience optimizing GPU workloads and or developing kernels for common DL operators
Experience with NVIDIA software libraries such as CUDA and TensorRT
In depth experience with the internals of deep learning frameworks such as PyTorch or JAX (e.g. Creating custom operators, Graph fusions, deployment of PyTorch models)
Experience using current generation kernel authoring DSLs such as Triton or cuTile or similar

Benefits

Equity
Benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase