Model-as-a-Service Tech Lead jobs in United States
cer-icon
Apply on Employer Site
company-logo

NVIDIA · 8 hours ago

Model-as-a-Service Tech Lead

NVIDIA has been transforming computer graphics and accelerated computing for over 25 years, and they are seeking a highly qualified Tech Lead to drive the technical vision and implementation of their web-based platform. This role involves significant hands-on coding and leadership in building scalable solutions for autonomous driving scenarios and synthetic data generation.

Artificial Intelligence (AI)Consumer ElectronicsGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Serve as the primary, high-impact contributor on complex features. Dedicate significant time to producing production code across the full stack, including UI, APIs, services, and infrastructure
Code Review Leadership & Quality Assurance: Lead the code review process, setting and implementing thorough coding standards, performance benchmarks, and architectural integrity to ensure all merged code is high-quality, maintainable, and robust
Architectural Ownership & Portability: Define and own the long-term technical roadmap, architecture, and design. This includes the required assurance that the deployment pipelines and services are platform-agnostic and easily deployable across the broader NVIDIA ecosystem, deliberately avoiding internal infrastructure dependencies
Foundation Model Deployment Strategy: Lead the strategic implementation of web services and efficient batch processing queues to seamlessly integrate and operationalize our world foundation models into the customer-facing platform
System Performance & Reliability: Implement and make sure standards for production-grade performance, monitoring, and fault tolerance across all services. Proactively identify and resolve systemic technical debt and scalability bottlenecks
Deployment & Operational Excellence: Take ultimate ownership of the CI/CD pipelines, container orchestration strategy (Kubernetes/Helm), and operational readiness, ensuring seamless scalability and reliability in production
Team Mentorship & Guidance: Mentor and guide the engineering team on advanced practices in full-stack development, distributed systems design, performance optimization, and clean, portable code architecture
Multi-functional Partnership: Act as the key technical liaison, translating complex requirements from Product Managers, ML Engineers, and Data Scientists into robust, portable, and implementable designs

Qualification

Full-stack developmentDistributed microservices architectureContainer orchestrationCloud InfrastructureCI/CD practicesPythonGoTypeScriptMaaS patternsTeam mentorshipArchitectural designPerformance optimizationData querying platformsAutonomous vehicle simulation

Required

Minimum 12+ years of hands-on experience developing and deploying scalable full-stack web services in a cloud environment
Proven Tech Lead or equivalent Senior/Staff level experience with demonstrated ability to define system architecture, mentor engineers, and take end-to-end technical ownership of a major platform while remaining deeply active in coding and code reviews
Expert-level proficiency in designing and scaling distributed microservices architectures using gRPC and REST APIs
Deep expertise in modern frontend frameworks and building highly responsive, data-intensive UIs capable of managing high-frequency data flows
Direct experience designing and deploying containerized applications that use a GPU (e.g., NVIDIA Container Toolkit)
Experience with MaaS (Model-as-a-Service) patterns and serving large machine learning models as high-throughput endpoints
Mastery of container orchestration, including Kubernetes and Helm for sophisticated, portable, multi-service production deployments
Proficiency in backend languages such as Python and/or Go, and TypeScript for the frontend
Strong practical experience with Cloud Infrastructure (AWS S3) and running complex data storage/access patterns (SQL, key-value stores)
Expertise in CI/CD practices (GitLab, Jenkins) with a focus on automation, testing, and improving deployment velocity and stability
Bachelor's degree (B.S.) or equivalent experience in Computer Science, Software Engineering, Electrical Engineering, or a closely related technical field

Preferred

Master's degree (M.S.) preferred
Experience in data querying platforms such as Apache Druid, ClickHouse, or Elasticsearch
Familiarity with autonomous vehicle simulation environments (e.g., Carla) and synthetic data generation pipelines using foundational models

Benefits

Equity
Benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1418)
2024 (1356)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase