Apply on Employer Site

Anthropic · 2 weeks ago

[Expression of Interest] Research Scientist/Engineer, Alignment Finetuning

San Francisco, CA

Full-time

Hybrid

Mid, Senior Level

$315K/yr - $340K/yr

Anthropic is a public benefit corporation dedicated to creating reliable and beneficial AI systems. As a Research Scientist/Engineer on the Alignment Finetuning team, you will lead the development of techniques to train language models that align better with human values, focusing on moral reasoning and improved honesty.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning

H1B Sponsored

Responsibilities

Develop and implement novel finetuning techniques using synthetic data generation and advanced training pipelines

Use these to train models to have better alignment properties including honesty, character, and harmlessness

Create and maintain evaluation frameworks to measure alignment properties in models

Collaborate across teams to integrate alignment improvements into production models

Develop processes to help automate and scale the work of the team

Qualification

PythonML model trainingML research implementationAnalytical skillsML metricsLanguage model finetuningSynthetic data generationCollaboration skillsProblem-solving skills

Required

Have an MS/PhD in Computer Science, ML, or related field, or equivalent experience

Possess strong programming skills, especially in Python

Have experience with ML model training and experimentation

Have a track record of implementing ML research

Demonstrate strong analytical skills for interpreting experimental results

Have experience with ML metrics and evaluation frameworks

Excel at turning research ideas into working code

Can identify and resolve practical implementation challenges

Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience

Preferred

Experience with language model finetuning

Background in AI alignment research

Published work in ML or alignment

Experience with synthetic data generation

Familiarity with techniques like RLHF, constitutional AI, and reward modeling

Track record of designing and implementing novel training approaches

Experience with model behavior evaluation and improvement

Benefits

Equity

Benefits

Incentive compensation

Optional equity donation matching

Generous vacation and parental leave

Flexible working hours

Company

Anthropic

Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.

Founded in 2021

San Francisco, California, USA

501-1000 employees

https://www.anthropic.com

H1B Sponsorship

Anthropic has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (105)

2024 (13)

2023 (3)

2022 (4)

2021 (1)

Funding

Current Stage

Late Stage

Total Funding

$33.74B

Key Investors

Lightspeed Venture PartnersGoogleAmazon

2025-09-02Series F· $13B

2025-05-16Debt Financing· $2.5B

2025-03-03Series E· $3.5B

Leadership Team

Dario Amodei

CEO & Co-Founder

Daniela Amodei

President and co-founder

Recent News

PitchBook

Surge in mega-deals vaults US to top of global VC per capita list

2026-01-11

Insurance giant Allianz signs Claude Code deal with Anthropic | CIO

Insurance giant Allianz signs Claude Code deal with Anthropic

2026-01-11

Venturebeat

Anthropic cracks down on unauthorized Claude usage by third-party harnesses and rivals

2026-01-11

Company data provided by crunchbase