Apply on Employer Site

Reflection AI · 16 hours ago

Member of Technical Staff - Alignment Lead

New York, NY

Full-time

Onsite

Senior Level

Reflection AI is focused on building open superintelligence and making it accessible to all. The Alignment Lead will drive the alignment stack and lead research efforts to improve model performance and stability.

Computer Software

H1B Sponsor Likely

Responsibilities

Drive the entire alignment stack, spanning instruction tuning, RLHF, and RLAIF, to push the model toward high factual accuracy and robust instruction following

Lead research efforts to design next-generation reward models and optimization objectives that significantly improve human preference (HP) performance

Curate high-quality training data and design synthetic data pipelines that solve complex reasoning and behavioral gaps

Optimize large-scale RL pipelines for stability and efficiency, ensuring rapid iteration cycles for model improvements

Collaborate closely with pre-training and evaluation teams to create tight feedback loops that translate alignment research into generalizable model gains

Qualification

Alignment methodologiesMachine LearningLarge-scale RL pipelinesComplex ML codebasesReward modelingResearch ownershipPassion for intelligenceFast-paced environment

Required

Graduate degree (MS or PhD) in Computer Science, Machine Learning, or related discipline

Deep technical command of alignment methodologies (PPO, DPO, rejection sampling) and experience scaling them to large models

Strong engineering skills, comfortable diving into complex ML codebases and distributed systems

Experience improving model behavior through data, reward modeling, or RL techniques

Evidence of owning ambitious research or engineering agendas that led to measurable model improvements

Thrive in a fast-paced, high-agency startup environment with bias toward action

Passionate about advancing the frontier of intelligence

Benefits

Comprehensive medical, dental, vision, life, and disability insurance.

Fully paid parental leave for all new parents, including adoptive and surrogate journeys.

Financial support for family planning.

Paid time off when you need it, relocation support, and more perks that optimize your time.

Lunch and dinner are provided daily.

Regular off-sites and team celebrations.

Company

Reflection AI

Frontier open intelligence accessible to all. Our team previously built frontier LLMs at labs like DeepMind, OpenAI, and Anthropic.

New York, NY, US

11-50 employees

https://www.reflection.ai/

H1B Sponsorship

Reflection AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (5)

Funding

Current Stage

Early Stage

Company data provided by crunchbase