Apply on Employer Site

Turing · 1 month ago

Research Engineer - Post Training

Palo Alto, California, United States

Full-time

Onsite

Mid, Senior Level

$170K/yr - $220K/yr

Turing is a leading research accelerator for frontier AI labs based in San Francisco. The Research Engineer role focuses on building foundational systems for generating and refining data to support advanced AI, with responsibilities including designing data pipelines and advancing reinforcement learning algorithms.

Artificial Intelligence (AI)Generative AIInformation TechnologyMachine LearningSoftware Engineering

H1B Sponsor Likely

Responsibilities

Design and implement large-scale data pipelines for RL, SFT, and post-training workflows to create and refine high-quality datasets

Develop and iterate on simulated or interactive environments to train, evaluate, and stress-test reasoning and coding agents

Advance reinforcement learning algorithms and generalizable reward models to improve model reasoning, coding, and decision-making

Define and improve data quality and evaluation metrics to ensure models are learning from the best possible signals

Implement scalable model evaluation frameworks to measure progress in reasoning, code generation, and agentic capabilities

Collaborate cross-functionally with research, data, multimodal, and product teams to bring cutting-edge research into real-world impact

Qualification

Reinforcement learningLarge language modelsData pipelinesModel evaluation metricsSoftware engineeringData-driven methodsBenchmark developmentCross-functional collaborationProblem-solving

Required

Strong software engineering and systems-building skills

Deep understanding of machine learning and fine-tuning of large language models (LLMs)

Hands-on experience improving model behavior through data-driven methods and reinforcement learning (SFT, PPO, DPO, or similar)

Familiarity with large-scale data generation and evaluation pipelines

Experience developing benchmarks and evaluation metrics for reasoning, coding, or multi-agent systems

Proven ability to design or optimize models for complex challenges such as multi-modality, long-context reasoning, or multi-agent orchestration

Preferred

Experience building or scaling infrastructure for large-scale RL or post-training

Benefits

Amazing work culture (Super collaborative & supportive work environment; 5 days a week)

Awesome colleagues (Surround yourself with top talent from Meta, Google, LinkedIn etc. as well as people with deep startup experience)

Competitive compensation

Flexible working hours

Company

Turing

Glassdoor3.9

Turing advances frontier AI and builds real-world systems for Fortune 500 companies, governments, and the world’s leading AI labs.

Founded in 2018

Palo Alto, California, USA

1001-5000 employees

https://www.turing.com

H1B Sponsorship

Turing has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (16)

2024 (8)

2023 (7)

2022 (16)

2021 (6)

Funding

Current Stage

Late Stage

Total Funding

$270.19M

Key Investors

Khazanah NasionalAltaIR CapitalWestBridge Capital

2025-03-06Series E· $111M

2021-12-07Convertible Note· $6.85M

2021-10-04Series D· $87M

Leadership Team

Jonathan Siddharth

Founder & CEO

Vijay Krishnan

Founder & CTO

Recent News

Foundation Capital

Foundation Capital Portfolio

2025-12-31

BRIDGE

Turing, Developer of Fully Autonomous Driving Systems, Raises ¥15.27 Billion in Series A

2025-11-23

EIN Presswire

Former Google VP Catherine Lacavera Joins Toborlife AI Board of Directors

2025-11-22

Company data provided by crunchbase