Research Engineer, Interpretability jobs in United States
cer-icon
Apply on Employer Site
company-logo

Anthropic · 1 day ago

Research Engineer, Interpretability

Anthropic is a public benefit corporation focused on creating reliable and interpretable AI systems. They are seeking a Research Engineer for their Interpretability team to help reverse-engineer how trained models work and improve model safety through mechanistic interpretability.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
check
H1B Sponsorednote

Responsibilities

Implement and analyze research experiments, both quickly in toy scenarios and at scale in large models
Set up and optimize research workflows to run efficiently and reliably at large scale
Build tools and abstractions to support rapid pace of research experimentation
Develop and improve tools and infrastructure to support other teams in using Interpretability’s work to improve model safety

Qualification

PythonMachine LearningNeural NetworksPytorchLanguage ModelingCollaborationCommunication SkillsProblem Solving

Required

5-10+ years of experience building software
Highly proficient in at least one programming language (e.g., Python, Rust, Go, Java) and productive with python
Some experience contributing to empirical AI research projects
Strong ability to prioritize and direct effort toward the most impactful work and are comfortable operating with ambiguity and questioning assumptions
Prefer fast-moving collaborative projects to extensive solo efforts
Want to learn more about machine learning research and its applications and collaborate closely with researchers
Care about the societal impacts and ethics of your work
At least a Bachelor's degree in a related field or equivalent experience

Preferred

Designing a code base so that anyone can quickly code experiments, launch them, and analyze their results without hitting bugs
Optimizing the performance of large-scale distributed systems
Collaborating closely with researchers
Language modeling with transformers
GPUs or Pytorch

Benefits

Equity and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
A lovely office space in which to collaborate with colleagues

Company

Anthropic

twittertwittertwitter
company-logo
Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.

H1B Sponsorship

Anthropic has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (105)
2024 (13)
2023 (3)
2022 (4)
2021 (1)

Funding

Current Stage
Late Stage
Total Funding
$33.74B
Key Investors
Lightspeed Venture PartnersGoogleAmazon
2025-09-02Series F· $13B
2025-05-16Debt Financing· $2.5B
2025-03-03Series E· $3.5B

Leadership Team

leader-logo
Dario Amodei
CEO & Co-Founder
linkedin
leader-logo
Daniela Amodei
President and co-founder
linkedin
Company data provided by crunchbase