Senior Technical Product Manager Token Factory - Inference jobs in United States
cer-icon
Apply on Employer Site
company-logo

Nebius · 12 hours ago

Senior Technical Product Manager Token Factory - Inference

Nebius is leading a new era in cloud computing to serve the global AI economy. The Senior Technical Product Manager for Token Factory will lead the definition, development, and delivery of inference capabilities, focusing on scalable, production-grade machine learning systems.

AI InfrastructureCloud InfrastructureGPUIaaSPaaS
check
Growth Opportunities

Responsibilities

Own the product roadmap for Nebius Token Factory inference capabilities, focusing on high-load, production-grade ML scenarios
Be involved in customer PoCs involving distributed ML model deployment, inference orchestration, and optimization
Work closely with engineering and research teams to shape scalable infrastructure for real-time and batch inference
Act as the technical voice in customer conversations, translating ML workflows into product requirements
Drive product adoption by delivering tools and features that solve real-world inference problems at scale

Qualification

Product managementMachine learning systemsCloud infrastructureML inference toolsMLOps knowledgeCommunicatorTechnical foundation

Required

3–5 years of product management experience, ideally in cloud infrastructure, ML platforms, or developer tools
Strong technical foundation (e.g. Computer Science or Engineering degree) with ability to dive deep into model architectures and serving systems
Familiarity with modern ML inference tools and frameworks (e.g., Triton Inference Server, vLLM, SGLang, TensorRT-LLM, Dynamo, KServe, Ray Serve)
Proven track record of delivering technically complex products that support distributed and high-throughput ML pipelines
Strong communicator with experience working across engineering, research, and customer-facing teams

Preferred

Deep understanding of modern ML architectures, including transformer-based models and their inference characteristics
Experience delivering or supporting ML solutions in production as part of a customer-facing or solutions role
Knowledge of MLOps or AIOps cycles, including observability, performance optimization, and continuous delivery of ML systems

Benefits

Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
401(k) plan: Up to 4% company match with immediate vesting.
Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
Remote work reimbursement: Up to $85/month for mobile and internet.
Disability & life insurance: Company-paid short-term, long-term and life insurance coverage.

Company

Nebius

twittertwittertwitter
company-logo
The Nebius AI Cloud brings powerful full-stack infrastructure for AI developers and practitioners across startups, enterprises and science institutes to build and deploy generative AI applications and rapidly deliver scientific breakthroughs by training and running ML models within a secure, high-performance, and cost-optimized cloud environment.

Funding

Current Stage
Late Stage
Total Funding
$1.04B
2025-06-04Debt Financing· $1B
2025-05-15Grant· $45M
2024-12-02Seed

Leadership Team

E
Evan Helda
Head of Physical AI
linkedin
leader-logo
Vinita Ananth
Sr. Director of Product
linkedin
Company data provided by crunchbase