MongoDB · 3 months ago
Lead Engineer, Inference Platform
MongoDB is a company that empowers innovators to create and disrupt industries through software and data. They are seeking a Lead Engineer for the Inference Platform, responsible for building and optimizing the infrastructure for embedding models that enhance semantic search and AI capabilities within MongoDB Atlas.
Cloud ComputingDatabaseOpen SourceSaaSSoftware
Responsibilities
Partner with Search Platform and Voyage.ai AI engineers and researchers to productionize state-of-the-art embedding models and rerankers, supporting both batch and real-time inference
Lead key projects around performance optimization, GPU utilization, autoscaling, and observability for the inference platform
Design and build components of a multi-tenant inference service that integrates with Atlas Vector Search, driving capabilities for semantic search and hybrid retrieval
Contribute to platform features like model versioning, safe deployment pipelines, latency-aware routing, and model health monitoring
Collaborate with peers across ML, infra, and product teams to define architectural patterns and operational practices that support high availability and low latency at scale
Guide decisions on model serving architecture using tools like vLLM, ONNX Runtime, and container orchestration in Kubernetes
Provide technical leadership and mentorship to junior engineers, fostering a culture of technical excellence and continuous improvement within the team
Qualification
Required
8+ years of engineering experience in backend systems, ML infrastructure, or scalable platform development, and the ability to provide technical leadership and guidance to a team of engineers
Expertise in serving embedding models in production environments
Strong systems skills in languages like Go, Rust, C++, or Python, and experience profiling and optimizing performance
Comfortable working on cloud-native distributed systems, with a focus on latency, availability, and observability
Familiarity with inference runtimes and vector search systems (e.g., Faiss, HNSW, ScaNN)
Proven ability to collaborate across disciplines and experience levels, from ML researchers to junior engineers
Experience with high-scale SaaS infrastructure, particularly in multi-tenant environments
1+ years of experience serving as TL for a large-scale ML inference or training platform SW project
Preferred
Prior experience working with model teams on inference-optimized architectures
Background in hybrid retrieval, prompt-based pipelines, or retrieval-augmented generation (RAG)
Contributions to relevant open-source ML serving infrastructure
1+ years of experience in managing a technical team focused on ML inference or training infrastructure
Benefits
Equity
Participation in the employee stock purchase program
Flexible paid time off
20 weeks fully-paid gender-neutral parental leave
Fertility and adoption assistance
401(k) plan
Mental health counseling
Access to transgender-inclusive health insurance coverage
Health benefits offerings
Company
MongoDB
MongoDB is a next-generation database that helps businesses transform their industries by harnessing the power of data.
H1B Sponsorship
MongoDB has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (159)
2024 (150)
2023 (133)
2022 (79)
2021 (51)
2020 (30)
Funding
Current Stage
Public CompanyTotal Funding
$311MKey Investors
SalesforceEquityZenT. Rowe Price
2024-11-14Post Ipo Equity
2024-10-16Post Ipo Debt
2018-03-06Post Ipo Equity
Recent News
Investing.com
2026-01-06
2025-12-15
Company data provided by crunchbase