Apply on Employer Site

Anthropic · 5 hours ago

Staff Machine Learning Engineer, Virtual Collaborator

NYC Metro Area

Full-time

Hybrid

Lead/Staff

$500K/yr - $850K/yr

Anthropic is a public benefit corporation focused on creating reliable and beneficial AI systems. They are seeking a Staff Machine Learning Engineer to design and implement reinforcement learning environments for their AI system, Claude, specifically for virtual collaborator workflows.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning

H1B Sponsored

Responsibilities

Designing and implementing reinforcement learning pipelines specifically targeted at virtual collaborator use cases (productivity, organizational navigation, vertical domains)

Building and scaling our data creation platform for generating high-quality, open-ended tasks with domain experts and crowdworkers Integrating real organizational data to create authentic training environments

Developing robust rubric-based evaluation systems that maintain quality while avoiding reward hacking

Training Claude on advanced document manipulation, including understanding, enhancing, and co-creating

Partnering directly with product teams to ensure training aligns with shipped features

Qualification

PythonMachine LearningReinforcement LearningData PipelinesEvaluation FrameworksCollaborationProblem SolvingCommunication Skills

Required

At least a Bachelor's degree in a related field or equivalent experience

Very experienced Python programmer who can quickly produce reliable, high quality code

Strong machine learning experience

Thrive at the intersection of research and product, with a pragmatic approach to solving real-world problems

Comfortable with ambiguity and can balance research rigor with shipping deadlines

Enjoy collaborating across multiple teams (data operations, model training, product)

Can context-switch between research problems and product engineering tasks

Care about making AI genuinely helpful for everyday enterprise workflows

Preferred

Building human-in-the-loop training systems or crowdsourcing platforms

Working with enterprise tools and APIs (Google Workspace, Microsoft Office, Slack, etc.)

Developing evaluation frameworks for open-ended tasks

Domain expertise in finance, legal, or healthcare workflows

Creating scalable data pipelines with quality control mechanisms

Reward modeling and preventing reward hacking in RL systems

Translating product requirements into technical training objectives

Benefits

Equity and benefits

Optional equity donation matching

Generous vacation and parental leave

Flexible working hours

Company

Anthropic

Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.

Founded in 2021

San Francisco, California, USA

501-1000 employees

https://www.anthropic.com

H1B Sponsorship

Anthropic has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (105)

2024 (13)

2023 (3)

2022 (4)

2021 (1)

Funding

Current Stage

Late Stage

Total Funding

$33.74B

Key Investors

Lightspeed Venture PartnersGoogleAmazon

2025-09-02Series F· $13B

2025-05-16Debt Financing· $2.5B

2025-03-03Series E· $3.5B

Leadership Team

Dario Amodei

CEO & Co-Founder

Daniela Amodei

President and co-founder

Recent News

Sherwood News

Stocks dragged down by financials and hyperscalers

2026-01-16

The Express Tribune

AI wants access to your medical records — should you say yes?

2026-01-16

PYMNTS.com

How Healthcare Innovation Starts With Regulation and Ends With Integration

2026-01-16

Company data provided by crunchbase