Apply on Employer Site

Reflection AI · 1 day ago

Member of Technical Staff - Data Quality Engineer (Post-training)

United States

Full-time

Remote

Mid Level

ReflectionAI is dedicated to building open superintelligence accessible to all. The Data Quality Engineer will ensure the data used for training and evaluating models meets high standards of quality and reliability, directly impacting model performance and capabilities.

Computer Software

H1B Sponsor Likely

Responsibilities

Own upstream data quality for LLM post-training and evaluation by analyzing expert-developed datasets and operationalizing quality standards for reasoning, alignment, and agentic use cases

Partner closely with research and post-training teams to translate requirements into measurable quality signals, and provide actionable feedback to external data vendors

Design, validate, and scale automated QA methods, including LLM-as-a-Judge frameworks, to reliably measure data quality across large campaigns

Build reusable QA pipelines that reliably deliver high-quality data to post-training teams for model training and evaluation

Monitor and report on data quality over time, driving continuous iteration on quality standards, processes, and acceptance criteria

Qualification

PythonML / LLM workflowsAutomated QA methodsLarge datasetsData quality analysisAnalytical mindsetCommunicationDetail-oriented

Required

Strong engineering fundamentals with experience building data pipelines, QA systems, or evaluation workflows for post-training data and agentic environments

Detail-oriented with an analytical mindset, able to identify failure modes, inconsistencies, and subtle issues that affect data quality

Solid understanding of how data quality impacts training (SFT and RL) and evaluation, with the ability to translate quality concerns into concrete signals, decisions, and feedback

Experience designing and validating automated quality checks, including rule-based systems, statistical methods, or model-assisted approaches such as LLM-as-a-Judge

Comfortable working autonomously, owning problems end-to-end, and collaborating effectively with researchers, engineers, and operations partners

Proficiency in Python and building ML / LLM workflows. Must be comfortable debugging and writing scalable code

Experience working with large datasets and automated evaluation or quality-checking systems

Familiarity with how LLMs work and can describe how models are trained and evaluated

Excellent communication skills with the ability to clearly articulate complex technical concepts across teams

Benefits

Comprehensive medical, dental, vision, life, and disability insurance.

Fully paid parental leave for all new parents, including adoptive and surrogate journeys.

Financial support for family planning.

Paid time off when you need it, relocation support, and more perks that optimize your time.

Lunch and dinner are provided daily.

Regular off-sites and team celebrations.

Company

Reflection AI

Frontier open intelligence accessible to all. Our team previously built frontier LLMs at labs like DeepMind, OpenAI, and Anthropic.

San Francisco, California, US

11-50 employees

https://www.reflection.ai/

H1B Sponsorship

Reflection AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (5)

Funding

Current Stage

Early Stage

Company data provided by crunchbase