NVIDIA · 3 days ago
Senior Manager, CSP Engagements – System Software SWAT Team
NVIDIA is seeking a Senior Manager to lead their System Software SWAT Team within CSP Engagements, focusing on data center platforms. The role involves leading a cross-functional team to address complex system software issues for hyperscaler customers and ensuring high-quality outcomes through effective incident response and management.
Responsibilities
Lead a cross-functional SWAT team focused on rapid triage, debugging, and resolution of complex system software issues for hyperscaler customers
Drive technical incident response, war-room operations, and escalation management across firmware, Linux kernel, drivers, networking, virtualization, and observability layers
Build and mentor a high-performing team of senior engineers; set operational standards for incident response, on-call rotations, and continuous improvement
Serve as a primary technical and operational focal point for hyperscaler customers, managing expectations, communications, and participant relationships
Collaborate with CSP technical leads, TPMs, and internal engineering teams to deliver customer-validated solutions and influence product quality and release criteria
Operate customer-like labs to reproduce issues, validate fixes, and ensure robust telemetry and observability
Provide executive-level status updates, risk assessments, and recommendations for critical customer issues
Qualification
Required
12+ overall years of proven experience in system software (firmware, Linux kernel, drivers, networking, virtualization), with at least 5 years in data center or HPC software environments
Bachelor's degree or equivalent experience
Minimum 3+ years of direct experience working with hyperscalers in production environments
6+ yrs of experience in management
Proven leadership in managing customer escalations, technical incident response, and cross-functional teams
Deep technical expertise in Linux kernel, device drivers, ARM (aarch64) & x86, OpenBMC/SBIOS, out-of-band/in-band management, DMTF protocols (Redfish, PLDM, MCTP, SPDM), and networking (TCP/IP, Ethernet, InfiniBand)
Strong customer management and team member engagement skills; ability to communicate complex technical issues to executive and engineering audiences
Demonstrated success in reducing time-to-mitigation, improving release predictability, and driving continuous improvement in technical operations
Preferred
Experience building and operating customer-like labs, automation, and telemetry frameworks
Familiarity with GPU computing (CUDA), large-scale AI/HPC workloads, NVLink, Grace, and cluster-level deployment/management
Knowledge of CXL/memory fabric fundamentals and contributions to industry standards (OCP, DMTF)
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
Business Insider
2026-01-09
Business Insider
2026-01-09
Company data provided by crunchbase