Apply on Employer Site

Anthropic · 4 hours ago

Anthropic AI Safety Fellow

San Francisco Bay Area

Full-time

Hybrid

New Grad

$3,850/wk - $3,850/wk

Anthropic is dedicated to creating reliable and safe AI systems. They are seeking AI Safety Fellows to conduct empirical research on AI safety, benefiting from mentorship and access to resources while working on projects aligned with the company's research priorities.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning

No H1B

Responsibilities

Direct mentorship from Anthropic researchers

Access to a shared workspace (in either Berkeley, California or London, UK)

Connection to the broader AI safety research community

Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD & access to benefits (benefits vary by country)

Funding for compute (~$15k/month) and other research expenses

Undergo a project selection & mentor matching process

Work on an empirical project aligned with our research priorities, with the goal of producing a public output (e.g. a paper submission)

Collaborate with mentors in select AI safety research areas such as Scalable Oversight, Adversarial Robustness, Model Internals, and AI Welfare

Qualification

PythonEmpirical ML researchLarge Language ModelsDeep learning frameworksTechnical backgroundOpen-source contributionsCollaborative environmentsCommunication skills

Required

Are motivated by reducing catastrophic risks from advanced AI systems

Are excited to transition into full-time empirical AI safety research and would be interested in a full-time role at Anthropic

Have a strong technical background in computer science, mathematics, physics, cybersecurity, or related fields

Thrive in fast-paced, collaborative environments

Can implement ideas quickly and communicate clearly

Fluent in Python programming

Available to work full-time on the Fellows program for 4 months

We require at least a Bachelor's degree in a related field or equivalent experience

Preferred

Experience with empirical ML research projects

Experience working with Large Language Models

Experience in one of the research areas mentioned above

Experience with deep learning frameworks and experiment management

Track record of open-source contributions

Benefits

Access to a shared workspace (in either Berkeley, California or London, UK)

Connection to the broader AI safety research community

Funding for compute (~$15k/month) and other research expenses

Optional equity donation matching

Generous vacation and parental leave

Flexible working hours

Company

Anthropic

Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.

Founded in 2021

San Francisco, California, USA

501-1000 employees

https://www.anthropic.com

Funding

Current Stage

Late Stage

Total Funding

$33.74B

Key Investors

Lightspeed Venture PartnersGoogleAmazon

2025-09-02Series F· $13B

2025-05-16Debt Financing· $2.5B

2025-03-03Series E· $3.5B

Leadership Team

Dario Amodei

Co-Founder and CEO

Daniela Amodei

President and co-founder

Recent News

Straits Times

Crooks are using AI to up their game in cyber crimes

2026-01-23

WIRED

How Claude Code Is Reshaping Software—and Anthropic

2026-01-23

PR Newswire

Wonderful Launches Agent Builder, Enabling Autonomous Agent Creation for the Enterprise

2026-01-23

Company data provided by crunchbase