Lead Engineer, Inference Platform jobs in United States
cer-icon
Apply on Employer Site
company-logo

MongoDB · 3 months ago

Lead Engineer, Inference Platform

MongoDB is a company that empowers innovators to create and disrupt industries through software and data. They are seeking a Lead Engineer for the Inference Platform, responsible for building and optimizing the infrastructure for embedding models that enhance semantic search and AI capabilities within MongoDB Atlas.

Cloud ComputingDatabaseOpen SourceSaaSSoftware
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Partner with Search Platform and Voyage.ai AI engineers and researchers to productionize state-of-the-art embedding models and rerankers, supporting both batch and real-time inference
Lead key projects around performance optimization, GPU utilization, autoscaling, and observability for the inference platform
Design and build components of a multi-tenant inference service that integrates with Atlas Vector Search, driving capabilities for semantic search and hybrid retrieval
Contribute to platform features like model versioning, safe deployment pipelines, latency-aware routing, and model health monitoring
Collaborate with peers across ML, infra, and product teams to define architectural patterns and operational practices that support high availability and low latency at scale
Guide decisions on model serving architecture using tools like vLLM, ONNX Runtime, and container orchestration in Kubernetes
Provide technical leadership and mentorship to junior engineers, fostering a culture of technical excellence and continuous improvement within the team

Qualification

Embedding modelsPerformance optimizationCloud-native systemsMulti-tenant environmentsGoRustC++PythonInference runtimesHigh-scale SaaS infrastructureTechnical leadershipCollaboration skillsMentorship

Required

8+ years of engineering experience in backend systems, ML infrastructure, or scalable platform development, and the ability to provide technical leadership and guidance to a team of engineers
Expertise in serving embedding models in production environments
Strong systems skills in languages like Go, Rust, C++, or Python, and experience profiling and optimizing performance
Comfortable working on cloud-native distributed systems, with a focus on latency, availability, and observability
Familiarity with inference runtimes and vector search systems (e.g., Faiss, HNSW, ScaNN)
Proven ability to collaborate across disciplines and experience levels, from ML researchers to junior engineers
Experience with high-scale SaaS infrastructure, particularly in multi-tenant environments
1+ years of experience serving as TL for a large-scale ML inference or training platform SW project

Preferred

Prior experience working with model teams on inference-optimized architectures
Background in hybrid retrieval, prompt-based pipelines, or retrieval-augmented generation (RAG)
Contributions to relevant open-source ML serving infrastructure
1+ years of experience in managing a technical team focused on ML inference or training infrastructure

Benefits

Equity
Participation in the employee stock purchase program
Flexible paid time off
20 weeks fully-paid gender-neutral parental leave
Fertility and adoption assistance
401(k) plan
Mental health counseling
Access to transgender-inclusive health insurance coverage
Health benefits offerings

Company

MongoDB is a next-generation database that helps businesses transform their industries by harnessing the power of data.

H1B Sponsorship

MongoDB has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (159)
2024 (150)
2023 (133)
2022 (79)
2021 (51)
2020 (30)

Funding

Current Stage
Public Company
Total Funding
$311M
Key Investors
SalesforceEquityZenT. Rowe Price
2024-11-14Post Ipo Equity
2024-10-16Post Ipo Debt
2018-03-06Post Ipo Equity

Leadership Team

leader-logo
Jim Scharf
Chief Technology Officer
linkedin
leader-logo
Ronnen Miller
SVP, Global Technical Services
Company data provided by crunchbase