Apply on Employer Site

Anthropic · 2 weeks ago

Research Engineer / Scientist, Alignment Science

San Francisco, CA

Full-time

Hybrid

Mid, Senior Level

$315K/yr - $340K/yr

Anthropic is a public benefit corporation dedicated to creating reliable and beneficial AI systems. The Research Engineer on Alignment Science will conduct exploratory research on AI safety, focusing on the risks from powerful future systems, and collaborate with various teams to develop techniques for ensuring the safety and alignment of AI models.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning

H1B Sponsored

Responsibilities

Build and run elegant and thorough machine learning experiments to help us understand and steer the behavior of powerful AI systems

Contribute to exploratory experimental research on AI safety, with a focus on risks from powerful future systems

Develop techniques to keep highly capable models helpful and honest, even as they surpass human-level intelligence in various domains

Create methods to ensure advanced AI systems remain safe and harmless in unfamiliar or adversarial scenarios

Create model organisms of misalignment to improve our empirical understanding of how alignment failures might arise

Build and align a system that can speed up & improve alignment research

Understand and document the highest-stakes and most concerning emerging properties of models through pre-deployment alignment and welfare assessments

Develop robust defenses against adversarial attacks, comprehensive evaluation frameworks for model safety, and automated systems to detect and mitigate potential risks before deployment

Investigate and address potential model welfare, moral status, and related questions

Test the robustness of our safety techniques by training language models to subvert our safety techniques, and seeing how effective they are at subverting our interventions

Run multi-agent reinforcement learning experiments to test out techniques like AI Debate

Build tooling to efficiently evaluate the effectiveness of novel LLM-generated jailbreaks

Write scripts and prompts to efficiently produce evaluation questions to test models’ reasoning abilities in safety-relevant contexts

Contribute ideas, figures, and writing to research papers, blog posts, and talks

Run experiments that feed into key AI safety efforts at Anthropic, like the design and implementation of our Responsible Scaling Policy

Qualification

Machine LearningAI Safety ResearchReinforcement LearningPythonResearch EngineeringEmpirical ResearchCollaborationCommunication SkillsTechnical WritingProblem Solving

Required

Have significant software, ML, or research engineering experience

Have some experience contributing to empirical AI research projects

Have some familiarity with technical AI safety research

Prefer fast-moving collaborative projects to extensive solo efforts

Pick up slack, even if it goes outside your job description

Care about the impacts of AI

Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience

Preferred

Have experience authoring research papers in machine learning, NLP, or AI safety

Have experience with LLMs

Have experience with reinforcement learning

Have experience with Kubernetes clusters and complex shared codebases

Candidates Need Not Have 100% of the skills needed to perform the job

Formal certifications or education credentials

Benefits

Equity

Benefits

Incentive compensation

Optional equity donation matching

Generous vacation and parental leave

Flexible working hours

Company

Anthropic

Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.

Founded in 2021

San Francisco, California, USA

501-1000 employees

https://www.anthropic.com

H1B Sponsorship

Anthropic has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (105)

2024 (13)

2023 (3)

2022 (4)

2021 (1)

Funding

Current Stage

Late Stage

Total Funding

$33.74B

Key Investors

Lightspeed Venture PartnersGoogleAmazon

2025-09-02Series F· $13B

2025-05-16Debt Financing· $2.5B

2025-03-03Series E· $3.5B

Leadership Team

Dario Amodei

CEO & Co-Founder

Daniela Amodei

President and co-founder

Recent News

Qualcomm Ventures

Qualcomm Ventures Portfolio

2026-01-10

Venture Capital

Anthropic Plans to Raise $10 Billion at $350 Billion Valuation

2026-01-09

Morningstar.com

Dow Jones Top Company Headlines at 9 PM ET: Samsung Electronics Expects Record Fourth-Quarter Profit Beat | ExxonMobil ...

2026-01-09

Company data provided by crunchbase