Anthropic · 2 weeks ago
Product Manager, Safeguards
Anthropic is dedicated to creating reliable, interpretable, and steerable AI systems. The Product Manager for the Safeguards team will be responsible for the ideation, design, development, and deployment of safety systems to protect users from the risks associated with powerful AI.
Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
Responsibilities
Determine how to build in safety by design upstream and leverage downstream defenses for Anthropic’s frontier models, AI products, customers on different surfaces - Claude.ai, 1P API, external Cloud providers
Ability to write safety evals and communicate externally about safety
Drive impact via ruthless prioritization by clearly defining problems, solution options forward, clarity on both business & technical tradeoffs and accordingly clear requirements toward MVP vs. ideal state
Align & collaborate with policy, enforcement, research, engineering and cross functional stakeholders
Understand the AI landscape and ecosystem to plan for mitigation of deployment risks of increasingly powerful models and determined adversaries
Lead the development of metrics to understand the area, performance, blindspots to help inform future project planning
Qualification
Required
5+ years in product management with a focus on fast problem understanding, building roadmaps with tractable progress, ability to get into the details on data, detection & interventions, infrastructure & tools, and/or evals
Ability to make technical tradeoff decisions; ideally with experience working across policy experts, AI/ML research engineers and software engineering teams to design and build state of the art safety systems
Strong user understanding of how our products are used, their Safeguards concerns and how we provide the best solutions
Demonstrated ability to build product and engineering strategy across multiple cross-functional teams for a rapidly changing space
Demonstrated experience in designing and building metrics to evaluate risks, system performance, user impact and making crisp tradeoffs
Very strong ability to navigate, and prioritize amidst rapidly changing product specs, and to flex into different domains to bring clarity and execute
Evidence of exercising judgment and decision making in ambiguous situations
Planning, building, launching and measuring new products / systems in a zero to one environment
Ability to clearly articulate complex technical concepts to non-technical audiences in written and verbal communication
Think creatively about the risks and benefits of new technologies, and think beyond past checklists and playbooks
We require at least a Bachelor's degree in a related field or equivalent experience
Benefits
Equity and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Company
Anthropic
Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.
H1B Sponsorship
Anthropic has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (105)
2024 (13)
2023 (3)
2022 (4)
2021 (1)
Funding
Current Stage
Late StageTotal Funding
$33.74BKey Investors
Lightspeed Venture PartnersGoogleAmazon
2025-09-02Series F· $13B
2025-05-16Debt Financing· $2.5B
2025-03-03Series E· $3.5B
Recent News
Qualcomm Ventures
2026-01-10
2026-01-09
Company data provided by crunchbase