Anthropic · 4 hours ago
Anthropic AI Safety Fellow
Anthropic is dedicated to creating reliable and safe AI systems. They are seeking AI Safety Fellows to conduct empirical research on AI safety, benefiting from mentorship and access to resources while working on projects aligned with the company's research priorities.
Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
Responsibilities
Direct mentorship from Anthropic researchers
Access to a shared workspace (in either Berkeley, California or London, UK)
Connection to the broader AI safety research community
Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD & access to benefits (benefits vary by country)
Funding for compute (~$15k/month) and other research expenses
Undergo a project selection & mentor matching process
Work on an empirical project aligned with our research priorities, with the goal of producing a public output (e.g. a paper submission)
Collaborate with mentors in select AI safety research areas such as Scalable Oversight, Adversarial Robustness, Model Internals, and AI Welfare
Qualification
Required
Are motivated by reducing catastrophic risks from advanced AI systems
Are excited to transition into full-time empirical AI safety research and would be interested in a full-time role at Anthropic
Have a strong technical background in computer science, mathematics, physics, cybersecurity, or related fields
Thrive in fast-paced, collaborative environments
Can implement ideas quickly and communicate clearly
Fluent in Python programming
Available to work full-time on the Fellows program for 4 months
We require at least a Bachelor's degree in a related field or equivalent experience
Preferred
Experience with empirical ML research projects
Experience working with Large Language Models
Experience in one of the research areas mentioned above
Experience with deep learning frameworks and experiment management
Track record of open-source contributions
Benefits
Access to a shared workspace (in either Berkeley, California or London, UK)
Connection to the broader AI safety research community
Funding for compute (~$15k/month) and other research expenses
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Company
Anthropic
Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.
Funding
Current Stage
Late StageTotal Funding
$33.74BKey Investors
Lightspeed Venture PartnersGoogleAmazon
2025-09-02Series F· $13B
2025-05-16Debt Financing· $2.5B
2025-03-03Series E· $3.5B
Recent News
Straits Times
2026-01-23
2026-01-23
Company data provided by crunchbase