Research Engineer - Post Training jobs in United States
cer-icon
Apply on Employer Site
company-logo

Turing · 1 month ago

Research Engineer - Post Training

Turing is a leading research accelerator for frontier AI labs based in San Francisco. The Research Engineer role focuses on building foundational systems for generating and refining data to support advanced AI, with responsibilities including designing data pipelines and advancing reinforcement learning algorithms.

Artificial Intelligence (AI)Generative AIInformation TechnologyMachine LearningSoftware Engineering
check
H1B Sponsor Likelynote

Responsibilities

Design and implement large-scale data pipelines for RL, SFT, and post-training workflows to create and refine high-quality datasets
Develop and iterate on simulated or interactive environments to train, evaluate, and stress-test reasoning and coding agents
Advance reinforcement learning algorithms and generalizable reward models to improve model reasoning, coding, and decision-making
Define and improve data quality and evaluation metrics to ensure models are learning from the best possible signals
Implement scalable model evaluation frameworks to measure progress in reasoning, code generation, and agentic capabilities
Collaborate cross-functionally with research, data, multimodal, and product teams to bring cutting-edge research into real-world impact

Qualification

Reinforcement learningLarge language modelsData pipelinesModel evaluation metricsSoftware engineeringData-driven methodsBenchmark developmentCross-functional collaborationProblem-solving

Required

Strong software engineering and systems-building skills
Deep understanding of machine learning and fine-tuning of large language models (LLMs)
Hands-on experience improving model behavior through data-driven methods and reinforcement learning (SFT, PPO, DPO, or similar)
Familiarity with large-scale data generation and evaluation pipelines
Experience developing benchmarks and evaluation metrics for reasoning, coding, or multi-agent systems
Proven ability to design or optimize models for complex challenges such as multi-modality, long-context reasoning, or multi-agent orchestration

Preferred

Experience building or scaling infrastructure for large-scale RL or post-training

Benefits

Amazing work culture (Super collaborative & supportive work environment; 5 days a week)
Awesome colleagues (Surround yourself with top talent from Meta, Google, LinkedIn etc. as well as people with deep startup experience)
Competitive compensation
Flexible working hours

Company

Turing advances frontier AI and builds real-world systems for Fortune 500 companies, governments, and the world’s leading AI labs.

H1B Sponsorship

Turing has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (16)
2024 (8)
2023 (7)
2022 (16)
2021 (6)

Funding

Current Stage
Late Stage
Total Funding
$270.19M
Key Investors
Khazanah NasionalAltaIR CapitalWestBridge Capital
2025-03-06Series E· $111M
2021-12-07Convertible Note· $6.85M
2021-10-04Series D· $87M

Leadership Team

leader-logo
Jonathan Siddharth
Founder & CEO
linkedin
leader-logo
Vijay Krishnan
Founder & CTO
linkedin
Company data provided by crunchbase