Apply on Employer Site

Weights & Biases · 2 weeks ago

Principal Product Manager, W&B Inference - Weights & Biases

San Francisco, CA

Full-time

Hybrid

Senior Level, Lead/Staff

$206K/yr - $303K/yr

7+ years exp

Weights & Biases, acquired by CoreWeave, is creating a powerful platform for AI development and deployment. As a Principal Product Manager for the W&B Inference Service, you will define and drive the vision and execution of critical components to enhance performance, reliability, and usability for customers.

AI InfrastructureArtificial Intelligence (AI)Data VisualizationDeveloper ToolsGenerative AIMachine Learning

Comp. & Benefits

No H1B

U.S. Citizen Only

Responsibilities

Own the execution and evolution of the W&B Inference Service, delivering solutions that directly support the product vision and long-term platform strategy

Lead cross-team initiatives end-to-end, coordinating engineering, product, security, operations, and go-to-market stakeholders to ensure aligned priorities and seamless delivery across interdependent systems

Prioritize with intention, making informed trade-offs among performance, reliability, compliance, cost, and development velocity to ensure the inference service scales to meet customer and platform demands

Elevate developer and practitioner experiences by improving the operability, observability, and usability of the inference service and the tooling that surrounds it

Own execution from requirements through launch, defining success metrics, gathering customer and system insights, and ensuring every stage of development is anchored in measurable outcomes

Qualification

Product ManagementInference SystemsDistributed SystemsCloud InfrastructureModel ServingTechnical FluencyCustomer EmpathyExecution MindsetCross-functional LeadershipCollaborationProblem Solving

Required

A seasoned product manager with 7+ years working on high-scale platform or infrastructure products, with direct experience in model serving, inference systems, real-time APIs, or distributed compute services

You've worked across domains that commonly intersect with inference—including autoscaling, observability, GPU/accelerator utilization, routing/orchestration, developer tooling, IAM, and storage—and can reason about how changes ripple through a real-time serving stack

You can engage engineers on service architectures, performance bottlenecks, deployment topologies, model packaging formats, request/response patterns, and reliability trade-offs that impact low-latency inference. You're comfortable interpreting architecture diagrams and discussing how design decisions influence throughput, cost, and SLAs

Adept at coordinating teams across inference runtime, infrastructure, security, operations, and go-to-market, ensuring alignment on priorities that improve the performance, reliability, and usability of the inference service

You understand the workflows of ML practitioners running production models and the needs of internal developers building on top of the inference platform. You're motivated by uncovering friction in their serving pipelines and translating those insights into meaningful improvements

You excel in ambiguous, fast-moving environments. You bring clarity to competing priorities, make thoughtful trade-offs among latency, reliability, cost, and velocity, and consistently drive inference-focused initiatives from concept to launch

Preferred

Direct experience as a PM for an inference or model-serving service, ideally involving real-time, low-latency, or high-throughput workloads. Experience with frameworks like TensorFlow, PyTorch, or model-serialization formats is a plus

Background in adjacent platform domains such as identity & access management, billing and metering workflows, observability, or data infrastructure—especially where they intersect with running models in production

Strong familiarity with cloud infrastructure (AWS, GCP, Azure), container orchestration, autoscaling, and deployment automation tools used to operate distributed inference systems

Exposure to W&B or similar MLOps tools, especially experiment tracking, model management, or deployment workflows

Benefits

Medical, dental, and vision insurance - 100% paid for by CoreWeave

Company-paid Life Insurance

Voluntary supplemental life insurance

Short and long-term disability insurance

Flexible Spending Account

Health Savings Account

Tuition Reimbursement

Ability to Participate in Employee Stock Purchase Program (ESPP)

Mental Wellness Benefits through Spring Health

Family-Forming support provided by Carrot

Paid Parental Leave

Flexible, full-service childcare support with Kinside

401(k) with a generous employer match

Flexible PTO

Catered lunch each day in our office and data center locations

A casual work environment

A work culture focused on innovative disruption

Company

Weights & Biases

Glassdoor4.1

Weights & Biases is a developer-first MLOps platform that builds machine learning performance visualization tools.

Founded in 2017

San Francisco, California, USA

201-500 employees

https://www.wandb.ai

Funding

Current Stage

Growth Stage

Total Funding

$250M

Key Investors

NVIDIAInsight PartnersCoatue

2025-03-04Acquired

2023-09-01Secondary Market

2023-08-09Series Unknown· $50M

Leadership Team

Chris Van Pelt

Co-Founder & CISO

Shawn Lewis

Founder/CTO

Recent News

Qualcomm Ventures

Qualcomm Ventures Portfolio

2026-01-20

Dynamic Business

Weights & Biases: System of Record for the AI industry

2026-01-20

DIGIT

65% of Private AI Companies Exposed Secrets on GitHub, Report Claims

2025-11-12

Company data provided by crunchbase