Staff Machine Learning Engineer, Virtual Collaborator jobs in United States
cer-icon
Apply on Employer Site
company-logo

Anthropic · 5 hours ago

Staff Machine Learning Engineer, Virtual Collaborator

Anthropic is a public benefit corporation focused on creating reliable and beneficial AI systems. They are seeking a Staff Machine Learning Engineer to design and implement reinforcement learning environments for their AI system, Claude, specifically for virtual collaborator workflows.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
check
H1B Sponsorednote

Responsibilities

Designing and implementing reinforcement learning pipelines specifically targeted at virtual collaborator use cases (productivity, organizational navigation, vertical domains)
Building and scaling our data creation platform for generating high-quality, open-ended tasks with domain experts and crowdworkers Integrating real organizational data to create authentic training environments
Developing robust rubric-based evaluation systems that maintain quality while avoiding reward hacking
Training Claude on advanced document manipulation, including understanding, enhancing, and co-creating
Partnering directly with product teams to ensure training aligns with shipped features

Qualification

PythonMachine LearningReinforcement LearningData PipelinesEvaluation FrameworksCollaborationProblem SolvingCommunication Skills

Required

At least a Bachelor's degree in a related field or equivalent experience
Very experienced Python programmer who can quickly produce reliable, high quality code
Strong machine learning experience
Thrive at the intersection of research and product, with a pragmatic approach to solving real-world problems
Comfortable with ambiguity and can balance research rigor with shipping deadlines
Enjoy collaborating across multiple teams (data operations, model training, product)
Can context-switch between research problems and product engineering tasks
Care about making AI genuinely helpful for everyday enterprise workflows

Preferred

Building human-in-the-loop training systems or crowdsourcing platforms
Working with enterprise tools and APIs (Google Workspace, Microsoft Office, Slack, etc.)
Developing evaluation frameworks for open-ended tasks
Domain expertise in finance, legal, or healthcare workflows
Creating scalable data pipelines with quality control mechanisms
Reward modeling and preventing reward hacking in RL systems
Translating product requirements into technical training objectives

Benefits

Equity and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours

Company

Anthropic

twittertwittertwitter
company-logo
Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.

H1B Sponsorship

Anthropic has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (105)
2024 (13)
2023 (3)
2022 (4)
2021 (1)

Funding

Current Stage
Late Stage
Total Funding
$33.74B
Key Investors
Lightspeed Venture PartnersGoogleAmazon
2025-09-02Series F· $13B
2025-05-16Debt Financing· $2.5B
2025-03-03Series E· $3.5B

Leadership Team

leader-logo
Dario Amodei
CEO & Co-Founder
linkedin
leader-logo
Daniela Amodei
President and co-founder
linkedin
Company data provided by crunchbase