Weights & Biases · 2 weeks ago
Principal Product Manager, W&B Inference - Weights & Biases
Weights & Biases, acquired by CoreWeave, is creating a powerful platform for AI development and deployment. As a Principal Product Manager for the W&B Inference Service, you will define and drive the vision and execution of critical components to enhance performance, reliability, and usability for customers.
AI InfrastructureArtificial Intelligence (AI)Data VisualizationDeveloper ToolsGenerative AIMachine Learning
Responsibilities
Own the execution and evolution of the W&B Inference Service, delivering solutions that directly support the product vision and long-term platform strategy
Lead cross-team initiatives end-to-end, coordinating engineering, product, security, operations, and go-to-market stakeholders to ensure aligned priorities and seamless delivery across interdependent systems
Prioritize with intention, making informed trade-offs among performance, reliability, compliance, cost, and development velocity to ensure the inference service scales to meet customer and platform demands
Elevate developer and practitioner experiences by improving the operability, observability, and usability of the inference service and the tooling that surrounds it
Own execution from requirements through launch, defining success metrics, gathering customer and system insights, and ensuring every stage of development is anchored in measurable outcomes
Qualification
Required
A seasoned product manager with 7+ years working on high-scale platform or infrastructure products, with direct experience in model serving, inference systems, real-time APIs, or distributed compute services
You've worked across domains that commonly intersect with inference—including autoscaling, observability, GPU/accelerator utilization, routing/orchestration, developer tooling, IAM, and storage—and can reason about how changes ripple through a real-time serving stack
You can engage engineers on service architectures, performance bottlenecks, deployment topologies, model packaging formats, request/response patterns, and reliability trade-offs that impact low-latency inference. You're comfortable interpreting architecture diagrams and discussing how design decisions influence throughput, cost, and SLAs
Adept at coordinating teams across inference runtime, infrastructure, security, operations, and go-to-market, ensuring alignment on priorities that improve the performance, reliability, and usability of the inference service
You understand the workflows of ML practitioners running production models and the needs of internal developers building on top of the inference platform. You're motivated by uncovering friction in their serving pipelines and translating those insights into meaningful improvements
You excel in ambiguous, fast-moving environments. You bring clarity to competing priorities, make thoughtful trade-offs among latency, reliability, cost, and velocity, and consistently drive inference-focused initiatives from concept to launch
Preferred
Direct experience as a PM for an inference or model-serving service, ideally involving real-time, low-latency, or high-throughput workloads. Experience with frameworks like TensorFlow, PyTorch, or model-serialization formats is a plus
Background in adjacent platform domains such as identity & access management, billing and metering workflows, observability, or data infrastructure—especially where they intersect with running models in production
Strong familiarity with cloud infrastructure (AWS, GCP, Azure), container orchestration, autoscaling, and deployment automation tools used to operate distributed inference systems
Exposure to W&B or similar MLOps tools, especially experiment tracking, model management, or deployment workflows
Benefits
Medical, dental, and vision insurance - 100% paid for by CoreWeave
Company-paid Life Insurance
Voluntary supplemental life insurance
Short and long-term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Ability to Participate in Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family-Forming support provided by Carrot
Paid Parental Leave
Flexible, full-service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
A work culture focused on innovative disruption
Company
Weights & Biases
Weights & Biases is a developer-first MLOps platform that builds machine learning performance visualization tools.
Funding
Current Stage
Growth StageTotal Funding
$250MKey Investors
NVIDIAInsight PartnersCoatue
2025-03-04Acquired
2023-09-01Secondary Market
2023-08-09Series Unknown· $50M
Recent News
Qualcomm Ventures
2026-01-20
Dynamic Business
2026-01-20
Company data provided by crunchbase