SIGN IN
Director, Software Engineering - NIM Factory jobs in United States
cer-icon
Apply on Employer Site
company-logo

NVIDIA · 1 day ago

Director, Software Engineering - NIM Factory

NVIDIA is a leader in AI-powered applications, seeking a strategic, technically proficient Senior Manager of Software Engineering to lead the NVIDIA Inference Microservices (NIM) Factory group. This role involves directing a team to establish NIM as the global standard for AI inference, overseeing technical strategy, and ensuring operational excellence across various environments.
AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Set the Strategy: Define the multi‑year technical strategy and roadmap to establish NIM as the universal runtime and distribution standard for AI inference. Ensure we can build, ship, and operate NIMs at exponential scale while setting the industry bar for ease of use and performance
Manage the Portfolio: Develop the operating model for NIM Factory—OKRs, roadmap governance, technical standards, and cross-org dependency coordination—to achieve consistent results across various complex workstreams
Lead Leaders: Guide the NIM Factory engineering organization (managing managers, senior managers, and senior technical leaders) across containers, orchestration, workflow, observability, and platform APIs. Develop organizational structure, succession plans, and leadership skills as the organization expands
Accelerate “Day‑0” or equivalent experience Model Delivery: Manage the factory systems and platform capabilities to package and optimize the latest modern models right away. This bridges powerful research and enterprise production demands
Establish the Operational Standard: Define and guide the full platform standards that transform models, optimized runtimes, and validated configurations into standardized, enterprise-supported containers. Customers can deploy these containers across cloud, data center, and edge environments
Own Reliability, Security, and Compliance Outcomes: Partner with SRE and security leadership to set SLOs, establish durable incident/postmortem and release practices, and ensure NIMs meet enterprise expectations for availability, performance, and supply‑chain rigor
Champion Open Source and Standards: Lead our upstream strategy and partnerships to build standards in containerization, orchestration, and inference. Ensure NVIDIA contributes meaningfully and is an outstanding citizen in the ecosystem
Own cost efficiency, capacity strategy, and operational health for the global factory. Ensure we invest in the right capabilities and remove bottlenecks to growth

Qualification

Cloud-native engineeringEngineering managementAI inferenceOpen source leadershipLarge-scale LLM platformsCI/CD platformsCollaboration managementCommunicationCritical thinking

Required

15+ years of experience building and delivering production software systems. This includes 8+ years in engineering management and 3+ years managing managers at the Director level or equivalent
Proven track record of leading large engineering organizations (50+ engineers) and driving complex, multi-functional programs from inception to successful production launch and scale
Deep technical understanding of cloud‑native engineering (containers, Kubernetes, microservices) and modern SDLC practices; ability to dive deep into architecture and code when necessary
Strong critical thinking and business insight; ability to translate high-level business goals into actionable engineering strategies
Excellent communication and collaborator management; ability to influence executive leadership across product, research, security, and operations
A degree in Computer Science, Computer Engineering, or a related field (BS or MS) or equivalent experience

Preferred

Open Source Leadership: Significant contributions to or leadership of major open source projects in the AI/ML or cloud-native landscape (e.g., CNCF projects, Hugging Face ecosystem)
Led organizations that built and operated large‑scale LLM inference or model‑serving platforms (Triton, TensorRT‑LLM, vLLM, KServe) in production
Experience architecting next-generation container build systems or CI/CD platforms at extensive scale
Built and managed globally distributed organizations; established durable engineering processes that significantly improved quality and velocity across multiple teams
Recognized industry leader with contributions to open‑source ecosystems, technical publications, or talks in containers, Kubernetes, GPU, or inference communities

Benefits

Equity
Benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase