Member of Technical Staff - Alignment Lead jobs in United States
cer-icon
Apply on Employer Site
company-logo

Reflection AI ยท 15 hours ago

Member of Technical Staff - Alignment Lead

Reflection AI is focused on building open superintelligence and making it accessible to all. The Alignment Lead will drive the alignment stack and lead research efforts to improve model performance and stability.

Computer Software
check
H1B Sponsor Likelynote

Responsibilities

Drive the entire alignment stack, spanning instruction tuning, RLHF, and RLAIF, to push the model toward high factual accuracy and robust instruction following
Lead research efforts to design next-generation reward models and optimization objectives that significantly improve human preference (HP) performance
Curate high-quality training data and design synthetic data pipelines that solve complex reasoning and behavioral gaps
Optimize large-scale RL pipelines for stability and efficiency, ensuring rapid iteration cycles for model improvements
Collaborate closely with pre-training and evaluation teams to create tight feedback loops that translate alignment research into generalizable model gains

Qualification

Alignment methodologiesMachine LearningLarge-scale RL pipelinesComplex ML codebasesReward modelingResearch ownershipPassion for intelligenceFast-paced environment

Required

Graduate degree (MS or PhD) in Computer Science, Machine Learning, or related discipline
Deep technical command of alignment methodologies (PPO, DPO, rejection sampling) and experience scaling them to large models
Strong engineering skills, comfortable diving into complex ML codebases and distributed systems
Experience improving model behavior through data, reward modeling, or RL techniques
Evidence of owning ambitious research or engineering agendas that led to measurable model improvements
Thrive in a fast-paced, high-agency startup environment with bias toward action
Passionate about advancing the frontier of intelligence

Benefits

Comprehensive medical, dental, vision, life, and disability insurance.
Fully paid parental leave for all new parents, including adoptive and surrogate journeys.
Financial support for family planning.
Paid time off when you need it, relocation support, and more perks that optimize your time.
Lunch and dinner are provided daily.
Regular off-sites and team celebrations.

Company

Reflection AI

twitter
company-logo
Frontier open intelligence accessible to all. Our team previously built frontier LLMs at labs like DeepMind, OpenAI, and Anthropic.

H1B Sponsorship

Reflection AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (5)

Funding

Current Stage
Early Stage
Company data provided by crunchbase