Anthropic · 5 hours ago
Staff Machine Learning Engineer, Virtual Collaborator
Anthropic is a public benefit corporation focused on creating reliable and beneficial AI systems. They are seeking a Staff Machine Learning Engineer to design and implement reinforcement learning environments for their AI system, Claude, specifically for virtual collaborator workflows.
Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
Responsibilities
Designing and implementing reinforcement learning pipelines specifically targeted at virtual collaborator use cases (productivity, organizational navigation, vertical domains)
Building and scaling our data creation platform for generating high-quality, open-ended tasks with domain experts and crowdworkers Integrating real organizational data to create authentic training environments
Developing robust rubric-based evaluation systems that maintain quality while avoiding reward hacking
Training Claude on advanced document manipulation, including understanding, enhancing, and co-creating
Partnering directly with product teams to ensure training aligns with shipped features
Qualification
Required
At least a Bachelor's degree in a related field or equivalent experience
Very experienced Python programmer who can quickly produce reliable, high quality code
Strong machine learning experience
Thrive at the intersection of research and product, with a pragmatic approach to solving real-world problems
Comfortable with ambiguity and can balance research rigor with shipping deadlines
Enjoy collaborating across multiple teams (data operations, model training, product)
Can context-switch between research problems and product engineering tasks
Care about making AI genuinely helpful for everyday enterprise workflows
Preferred
Building human-in-the-loop training systems or crowdsourcing platforms
Working with enterprise tools and APIs (Google Workspace, Microsoft Office, Slack, etc.)
Developing evaluation frameworks for open-ended tasks
Domain expertise in finance, legal, or healthcare workflows
Creating scalable data pipelines with quality control mechanisms
Reward modeling and preventing reward hacking in RL systems
Translating product requirements into technical training objectives
Benefits
Equity and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Company
Anthropic
Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.
H1B Sponsorship
Anthropic has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (105)
2024 (13)
2023 (3)
2022 (4)
2021 (1)
Funding
Current Stage
Late StageTotal Funding
$33.74BKey Investors
Lightspeed Venture PartnersGoogleAmazon
2025-09-02Series F· $13B
2025-05-16Debt Financing· $2.5B
2025-03-03Series E· $3.5B
Recent News
Sherwood News
2026-01-16
The Express Tribune
2026-01-16
2026-01-16
Company data provided by crunchbase