Anthropic · 1 day ago
Research Engineer, Interpretability
Anthropic is a public benefit corporation focused on creating reliable and interpretable AI systems. They are seeking a Research Engineer for their Interpretability team to help reverse-engineer how trained models work and improve model safety through mechanistic interpretability.
Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
Responsibilities
Implement and analyze research experiments, both quickly in toy scenarios and at scale in large models
Set up and optimize research workflows to run efficiently and reliably at large scale
Build tools and abstractions to support rapid pace of research experimentation
Develop and improve tools and infrastructure to support other teams in using Interpretability’s work to improve model safety
Qualification
Required
5-10+ years of experience building software
Highly proficient in at least one programming language (e.g., Python, Rust, Go, Java) and productive with python
Some experience contributing to empirical AI research projects
Strong ability to prioritize and direct effort toward the most impactful work and are comfortable operating with ambiguity and questioning assumptions
Prefer fast-moving collaborative projects to extensive solo efforts
Want to learn more about machine learning research and its applications and collaborate closely with researchers
Care about the societal impacts and ethics of your work
At least a Bachelor's degree in a related field or equivalent experience
Preferred
Designing a code base so that anyone can quickly code experiments, launch them, and analyze their results without hitting bugs
Optimizing the performance of large-scale distributed systems
Collaborating closely with researchers
Language modeling with transformers
GPUs or Pytorch
Benefits
Equity and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
A lovely office space in which to collaborate with colleagues
Company
Anthropic
Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.
H1B Sponsorship
Anthropic has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (105)
2024 (13)
2023 (3)
2022 (4)
2021 (1)
Funding
Current Stage
Late StageTotal Funding
$33.74BKey Investors
Lightspeed Venture PartnersGoogleAmazon
2025-09-02Series F· $13B
2025-05-16Debt Financing· $2.5B
2025-03-03Series E· $3.5B
Recent News
2026-01-17
2026-01-17
Company data provided by crunchbase