Member of Technical Staff - Post-Training jobs in United States
cer-icon
Apply on Employer Site
company-logo

Reflection AI ยท 21 hours ago

Member of Technical Staff - Post-Training

Reflection AI is on a mission to build open superintelligence and make it accessible to all. The role involves building systems that transform pre-trained models into aligned agents and driving research initiatives in post-training optimization.

Computer Software
check
H1B Sponsor Likelynote

Responsibilities

Build systems that transform powerful pre-trained models into aligned and general agents
Drive research and engineering initiatives that push the frontier of post-training, from data curation to large-scale optimization
Develop data generation pipelines, reward models, reinforcement learning algorithms, and inference-time scaling techniques
Collaborate across pre-training and post-training teams to deliver step-function gains in model capability
Contribute to shaping our understanding of how large models learn to reason, follow instructions, and improve through reinforcement learning

Qualification

Machine Learning FundamentalsLarge-scale LLM TrainingReinforcement Learning TechniquesData Generation PipelinesDistributed SystemsCommunicationCollaborative WorkFast-paced Environment

Required

Deep understanding of machine learning fundamentals and practical experience with large-scale LLM training
Strong engineering skills, comfortable diving into complex ML codebases and distributed systems
Experience improving model behavior through data, reward modeling, or RL techniques
Evidence of owning ambitious research or engineering agendas that led to measurable model improvements
Thrive in a fast-paced, high-agency startup environment; bias toward action and clarity of execution
Able to work fluidly across research and infra boundaries
Strong communication capabilities and comfort working collaboratively
Passionate about advancing the frontier of intelligence

Benefits

Comprehensive medical, dental, vision, life, and disability insurance.
Fully paid parental leave for all new parents, including adoptive and surrogate journeys.
Financial support for family planning.
Paid time off when you need it, relocation support, and more perks that optimize your time.
Lunch and dinner are provided daily.
Regular off-sites and team celebrations.

Company

Reflection AI

twitter
company-logo
Frontier open intelligence accessible to all. Our team previously built frontier LLMs at labs like DeepMind, OpenAI, and Anthropic.

H1B Sponsorship

Reflection AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (5)

Funding

Current Stage
Early Stage
Company data provided by crunchbase