NVIDIA · 2 weeks ago
Senior Software Engineer, Deep Learning - Torch-TRT
NVIDIA has been transforming computer graphics and accelerated computing for over 25 years. They are seeking a Senior Deep Learning Software Engineer to develop and productize deep learning solutions for autonomous driving vehicles, focusing on compiler technology and optimization for NVIDIA hardware.
AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
Responsibilities
Developing compiler technologies to run various classes of model architecture (Transformer, Diffusion, VLA, CNN, RNN etc.) on NVIDIA hardware leveraging techniques such as reduced precision, quantization, workload scheduling and memory bandwidth optimization
Working across the whole lifetime of a model: training, fine-tuning, optimization to allow customers to access pioneering models on NVIDIA hardware
Develop workflows that let users leverage frameworks (e.g. PyTorch, JAX) and ecosystem tools (HuggingFace, MLIR) without forgoing performance
Stay up to date with the latest research and innovations in deep learning, implement and experiment with new insights to improve NVIDIA's Physical AI DNNs
Qualification
Required
MS or PhD degree in computer science, computer vision, robotics, computer architecture or equivalent experience in technical field
5+ years of work experience in software development
2+ years of experience in implementing deep learning models and optimizations such as graph fusions, kernel implementation, KV Caching etc
Domain experience in current innovative deep learning methods (e.g. diffusion models, vision language action models, etc.)
Strong Python and/or C/C++ programming skills
Proven technical foundation in CPU and GPU architectures, containers (nvidia-docker), numeric libraries, modular software design
Willing to take action and have strong analytical skills
Strong time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very sophisticated projects
Preferred
Background with low precision inference, quantization, compression of DNNs
Experience optimizing GPU workloads and or developing kernels for common DL operators
Experience with NVIDIA software libraries such as CUDA and TensorRT
In depth experience with the internals of deep learning frameworks such as PyTorch or JAX (e.g. Creating custom operators, Graph fusions, deployment of PyTorch models)
Experience using current generation kernel authoring DSLs such as Triton or cuTile or similar
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
Tech Startups - Tech News, Tech Trends & Startup Funding
2026-01-22
Dynamic Business
2026-01-22
2026-01-22
Company data provided by crunchbase