Anthropic AI Safety Fellow jobs in United States
cer-icon
Apply on Employer Site
company-logo

Anthropic · 4 hours ago

Anthropic AI Safety Fellow

Anthropic is dedicated to creating reliable and safe AI systems. They are seeking AI Safety Fellows to conduct empirical research on AI safety, benefiting from mentorship and access to resources while working on projects aligned with the company's research priorities.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
badNo H1Bnote

Responsibilities

Direct mentorship from Anthropic researchers
Access to a shared workspace (in either Berkeley, California or London, UK)
Connection to the broader AI safety research community
Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD & access to benefits (benefits vary by country)
Funding for compute (~$15k/month) and other research expenses
Undergo a project selection & mentor matching process
Work on an empirical project aligned with our research priorities, with the goal of producing a public output (e.g. a paper submission)
Collaborate with mentors in select AI safety research areas such as Scalable Oversight, Adversarial Robustness, Model Internals, and AI Welfare

Qualification

PythonEmpirical ML researchLarge Language ModelsDeep learning frameworksTechnical backgroundOpen-source contributionsCollaborative environmentsCommunication skills

Required

Are motivated by reducing catastrophic risks from advanced AI systems
Are excited to transition into full-time empirical AI safety research and would be interested in a full-time role at Anthropic
Have a strong technical background in computer science, mathematics, physics, cybersecurity, or related fields
Thrive in fast-paced, collaborative environments
Can implement ideas quickly and communicate clearly
Fluent in Python programming
Available to work full-time on the Fellows program for 4 months
We require at least a Bachelor's degree in a related field or equivalent experience

Preferred

Experience with empirical ML research projects
Experience working with Large Language Models
Experience in one of the research areas mentioned above
Experience with deep learning frameworks and experiment management
Track record of open-source contributions

Benefits

Access to a shared workspace (in either Berkeley, California or London, UK)
Connection to the broader AI safety research community
Funding for compute (~$15k/month) and other research expenses
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours

Company

Anthropic

twittertwittertwitter
company-logo
Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.

Funding

Current Stage
Late Stage
Total Funding
$33.74B
Key Investors
Lightspeed Venture PartnersGoogleAmazon
2025-09-02Series F· $13B
2025-05-16Debt Financing· $2.5B
2025-03-03Series E· $3.5B

Leadership Team

leader-logo
Dario Amodei
Co-Founder and CEO
linkedin
leader-logo
Daniela Amodei
President and co-founder
linkedin
Company data provided by crunchbase