Nebius · 12 hours ago
Senior Technical Product Manager Token Factory - Inference
Nebius is leading a new era in cloud computing to serve the global AI economy. The Senior Technical Product Manager for Token Factory will lead the definition, development, and delivery of inference capabilities, focusing on scalable, production-grade machine learning systems.
AI InfrastructureCloud InfrastructureGPUIaaSPaaS
Responsibilities
Own the product roadmap for Nebius Token Factory inference capabilities, focusing on high-load, production-grade ML scenarios
Be involved in customer PoCs involving distributed ML model deployment, inference orchestration, and optimization
Work closely with engineering and research teams to shape scalable infrastructure for real-time and batch inference
Act as the technical voice in customer conversations, translating ML workflows into product requirements
Drive product adoption by delivering tools and features that solve real-world inference problems at scale
Qualification
Required
3–5 years of product management experience, ideally in cloud infrastructure, ML platforms, or developer tools
Strong technical foundation (e.g. Computer Science or Engineering degree) with ability to dive deep into model architectures and serving systems
Familiarity with modern ML inference tools and frameworks (e.g., Triton Inference Server, vLLM, SGLang, TensorRT-LLM, Dynamo, KServe, Ray Serve)
Proven track record of delivering technically complex products that support distributed and high-throughput ML pipelines
Strong communicator with experience working across engineering, research, and customer-facing teams
Preferred
Deep understanding of modern ML architectures, including transformer-based models and their inference characteristics
Experience delivering or supporting ML solutions in production as part of a customer-facing or solutions role
Knowledge of MLOps or AIOps cycles, including observability, performance optimization, and continuous delivery of ML systems
Benefits
Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
401(k) plan: Up to 4% company match with immediate vesting.
Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
Remote work reimbursement: Up to $85/month for mobile and internet.
Disability & life insurance: Company-paid short-term, long-term and life insurance coverage.
Company
Nebius
The Nebius AI Cloud brings powerful full-stack infrastructure for AI developers and practitioners across startups, enterprises and science institutes to build and deploy generative AI applications and rapidly deliver scientific breakthroughs by training and running ML models within a secure, high-performance, and cost-optimized cloud environment.
Funding
Current Stage
Late StageTotal Funding
$1.04B2025-06-04Debt Financing· $1B
2025-05-15Grant· $45M
2024-12-02Seed
Recent News
2025-10-25
Company data provided by crunchbase