Protege · 2 weeks ago
Product Manager, Data Lab
Protege is building a platform to solve the biggest unmet need in AI by facilitating the secure and efficient exchange of training data. The Product Manager, Data Lab will translate cutting-edge AI research into scalable product capabilities, working closely with research scientists and ML engineers to enhance experimentation and product features.
AnalyticsArtificial Intelligence (AI)Data Management
Responsibilities
Partner closely with Data Lab scientists to understand how models are being trained, evaluated, and iterated today
Translate experimental workflows (data curation, labeling, evaluation, fine-tuning, feedback loops) into scalable product and platform capabilities
Identify patterns across experiments that are worth standardizing versus those that should remain bespoke
Lead product discovery and execution for internal tools that support modern AI development:
Dataset versioning
Evaluation pipelines
Annotation and human-in-the-loop workflows
Experiment tracking and reproducibility
Ensure tooling reflects real-world frontier practices, not academic abstractions
Serve as the primary product interface for the Data Lab
Translate research intuition into product requirements engineers can build against
Help researchers reason about tradeoffs between novelty, robustness, and scalability
Collaborate with Platform and Vertical PMs to ensure new capabilities integrate cleanly into customer-facing products
Decide when an experimental capability is ready to move from 'research mode' to 'product mode'
Apply an 80/20 mindset without undermining scientific rigor
Sunset or deprioritize tools and ideas that do not meaningfully advance AI development velocity or data quality
Define success metrics tied to:
Experiment cycle time
Researcher productivity
Adoption of internal tools
Downstream impact on customer data products
Use qualitative and quantitative feedback to continuously iterate
Qualification
Required
5+ years of product management experience, ideally in AI/ML platforms, developer tools, data infrastructure, or internal research tooling
Strong experience working with highly technical stakeholders
Proven ability to lead ambiguous, zero-to-one initiatives
Hands-on or adjacent experience with how frontier AI models are built today — including large-scale training, fine-tuning, evaluation, and data iteration
Understanding of concepts like training data quality vs quantity tradeoffs, evaluation benchmarks vs real-world performance, human feedback loops, multimodal data challenges
Ability to have credible conversations with PhD-level researchers and senior ML engineers
Ability to turn complex technical systems into clear product decisions
Excellent communicator across research, engineering, and product
Comfortable influencing without authority
Bias toward shipping, learning, and iterating
Preferred
Prior experience working with or adjacent to frontier model builders
Experience with multimodal AI systems (text, audio, video, healthcare data)
Background in ML engineering, data science, or applied research before PM
Benefits
Competitive compensation
Equity
Benefits
Company
Protege
Protege is the AI training data platform enabling seamless and compliant data exchange.
Funding
Current Stage
Early StageTotal Funding
$65MKey Investors
Andreessen HorowitzFootworkCRV
2026-01-07Series A· $30M
2025-08-13Series A· $25M
2024-09-10Seed· $10M
Recent News
2026-01-11
2026-01-09
Company data provided by crunchbase