Apply on Employer Site

Anthropic · 1 day ago

Research Engineer, Interpretability

San Francisco, CA

Full-time

Hybrid

Senior Level

$315K/yr - $560K/yr

5+ years exp

Anthropic is a public benefit corporation focused on creating reliable and interpretable AI systems. They are seeking a Research Engineer for their Interpretability team to help reverse-engineer how trained models work and improve model safety through mechanistic interpretability.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning

H1B Sponsored

Responsibilities

Implement and analyze research experiments, both quickly in toy scenarios and at scale in large models

Set up and optimize research workflows to run efficiently and reliably at large scale

Build tools and abstractions to support rapid pace of research experimentation

Develop and improve tools and infrastructure to support other teams in using Interpretability’s work to improve model safety

Qualification

PythonMachine LearningNeural NetworksPytorchLanguage ModelingCollaborationCommunication SkillsProblem Solving

Required

5-10+ years of experience building software

Highly proficient in at least one programming language (e.g., Python, Rust, Go, Java) and productive with python

Some experience contributing to empirical AI research projects

Strong ability to prioritize and direct effort toward the most impactful work and are comfortable operating with ambiguity and questioning assumptions

Prefer fast-moving collaborative projects to extensive solo efforts

Want to learn more about machine learning research and its applications and collaborate closely with researchers

Care about the societal impacts and ethics of your work

At least a Bachelor's degree in a related field or equivalent experience

Preferred

Designing a code base so that anyone can quickly code experiments, launch them, and analyze their results without hitting bugs

Optimizing the performance of large-scale distributed systems

Collaborating closely with researchers

Language modeling with transformers

GPUs or Pytorch

Benefits

Equity and benefits

Optional equity donation matching

Generous vacation and parental leave

Flexible working hours

A lovely office space in which to collaborate with colleagues

Company

Anthropic

Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.

Founded in 2021

San Francisco, California, USA

501-1000 employees

https://www.anthropic.com

H1B Sponsorship

Anthropic has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (105)

2024 (13)

2023 (3)

2022 (4)

2021 (1)

Funding

Current Stage

Late Stage

Total Funding

$33.74B

Key Investors

Lightspeed Venture PartnersGoogleAmazon

2025-09-02Series F· $13B

2025-05-16Debt Financing· $2.5B

2025-03-03Series E· $3.5B

Leadership Team

Dario Amodei

CEO & Co-Founder

Daniela Amodei

President and co-founder

Recent News

Geeky Gadgets

Meet Claude Cowork, Your On-Device AI Helper & Browser Automation System

2026-01-17

Geeky Gadgets

Claude Code Gets a 10x Speed Boost with New Tool Search

2026-01-17

Sherwood News

How Claude Code “is the ChatGPT moment repeated” — and why that’s awful news for software stocks

2026-01-17

Company data provided by crunchbase