NVIDIA · 3 days ago
Principal Software Engineer, Profiling Services
NVIDIA is a leading technology company focused on GPU technologies, and they are seeking a Principal Software Engineer for their Profiling Services team. The role involves designing and delivering a low-overhead GPU profiling service that integrates with ML workloads while leading architectural decisions and mentoring engineers.
Artificial Intelligence (AI)Consumer ElectronicsGPUHardwareSoftwareVirtual Reality
Responsibilities
Own the architecture for an Always‑On profiling service, defining interfaces, data flows, and scalability guarantees for multi‑process/GPU/node systems
Drive low‑overhead, high‑reliability implementations in C/C++, including IPC/shared memory, lock‑free buffers, and bounded CPU/memory budgets with clear benchmarks
Lead end‑to‑end feature delivery spanning user‑mode components, driver/platform layers, and performance counter/trace providers
Establish profiling models that integrate with existing ML/AI workflows (e.g., PyTorch/XLA) to turn low‑level signals into actionable insights
Set technical direction for an engineering team; mentor engineers, drive technical planning to mitigate architectural risks, and align roadmaps across internal and external partners
Qualification
Required
BS or MS degree or equivalent experience in Computer Engineering, Computer Science, or related degree
15+ years of system‑level C/C++ development, including concurrency, memory management, and performance engineering
Expertise with profiling/tracing stacks for CPU/GPU (e.g., CUPTI, Nsight, performance counters, event correlation) and debugging concurrent systems
Deep hands‑on CUDA and GPU architecture knowledge (runtime/driver APIs, CUDA streams/graphs, kernel behavior)
Proven experience designing and shipping production quality system software or drivers with strict reliability, observability, and performance constraints
Demonstrated technical leadership: defining architecture and success metrics, and translating abstract product visions into actionable technical roadmaps with fast-paced, multidisciplinary teams
Strong interpersonal, verbal, and written communication; able to influence across organizations and build trust with external collaborators
Preferred
Track record building continuous/always‑on or multi‑client profiling systems with predictable overhead at scale
Hands-on experience tuning ML training/inference loops based on deep profiling analysis
Familiarity with ML ecosystems (e.g., PyTorch, JAX) and correlating application‑level events with GPU traces/metrics
Strong background in translating profiling data into actionable performance insights (compute vs memory bound, bottleneck triage)
Experience with user‑mode driver development and integration with platform permissions/security models
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
2026-01-06
2026-01-06
Company data provided by crunchbase