Research, Post-Training jobs in United States
cer-icon
Apply on Employer Site
company-logo

Thinking Machines Lab · 1 month ago

Research, Post-Training

Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. The role of post-training researchers sits at the core of our roadmap, blending fundamental research and practical engineering to create AI systems that are useful, safe, and collaborative for humans.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyProduct ResearchSoftware
check
H1B Sponsorednote

Responsibilities

Develop and tune the recipe: iterate on post-training recipes, consisting of a collection of datasets, training stages, and hyperparameters. Measure how recipe choices affect various metrics
Iterate on evals: post-training involves a never-ending loop of defining a set of evaluations, optimizing them, and then realizing your existing evals don’t capture what matters. You’ll be responsible for both making numbers go up, and making sure the numbers are meaningful
Debug and understand: while tuning the details of a training configuration, we often observe results that don’t quite make sense. You’ll be responsible for both getting things to work, and developing a deeper understanding, which we can bring to the next problem
Scale and explore: post-training will involve a combination of scaling the existing methodologies and developing new ones. We’ll want to both measure how performance metrics scale with dataset size, and explore using a completely different kind of training dataset
Publish and present research that moves the entire community forward. Share code, datasets, and insights that accelerate progress across industry and academia

Qualification

PythonDeep learning frameworksProbabilityStatisticsMachine Learning fundamentalsResearch contributionsDebugging distributed trainingHuman data collectionClarity in communication

Required

Proficiency in Python and familiarity with at least one deep learning framework (e.g., PyTorch, TensorFlow, or JAX). Comfortable with debugging distributed training and writing code that scales
Bachelor's degree or equivalent experience in Computer Science, Machine Learning, Physics, Mathematics, or a related discipline with strong theoretical and empirical grounding
Clarity in communication, an ability to explain complex technical concepts in writing

Preferred

A strong grasp of probability, statistics, and ML fundamentals. You can look at experimental data and distinguish between real effects, noise, and bugs
Prior experience with RLHF, RLAIF, preference modeling, or reward learning for large models
Experience managing or analyzing human data collection campaigns or large-scale annotation workflows
Research or engineering contributions in alignment, data-centric AI, or human-AI collaboration
PhD in Computer Science, Machine Learning, Physics, Mathematics, or a related discipline with strong theoretical and empirical grounding; or, equivalent industry research experience

Benefits

Generous health, dental, and vision benefits
Unlimited PTO
Paid parental leave
Relocation support as needed

Company

Thinking Machines Lab

twittertwittertwitter
company-logo
Thinking Machines Lab is an AI research and product company that aims to increase understanding and customization of AI systems.

H1B Sponsorship

Thinking Machines Lab has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)

Funding

Current Stage
Early Stage
Total Funding
$2.01B
Key Investors
Andreessen HorowitzMinistry of Economy, Culture and Innovation
2025-06-20Seed· $2B
2025-05-05Grant· $9.98M

Leadership Team

leader-logo
Mira Murati
Co-Founder and Chief Executive Officer
linkedin

Recent News

Business Insider
Company data provided by crunchbase