AI Agent Evaluation Analyst for Autonomous Agents (No coding required) jobs in United States
cer-icon
Apply on Employer Site
company-logo

OpenTrain AI ยท 10 hours ago

AI Agent Evaluation Analyst for Autonomous Agents (No coding required)

OpenTrain AI is hiring detail-oriented, analytical contributors to help test and improve autonomous AI agent evaluations. The role involves reviewing evaluation tasks, identifying inconsistencies, and collaborating with teams to ensure thorough testing of agents.

Artificial Intelligence (AI)FreelanceMarketplaceOnline Portals
Hiring Manager
Akita Sanders
linkedin

Responsibilities

Review and refine agent evaluation tasks and scenarios for logic, completeness, and realism
Identify inconsistencies, ambiguities, and missing assumptions
Define gold-standard expected behaviors for agents
Annotate reasoning paths, cause-effect relationships, and plausible alternatives
Collaborate with QA, writers, and developers to suggest refinements and expand edge case coverage
Ensure autonomous agents are tested thoroughly and realistically

Qualification

Analytical thinkingFluent written EnglishAttention to detailReading JSONReading YAMLQA/test-case thinkingLogic puzzlesEvaluation frameworks

Required

Strong analytical thinking and excellent attention to detail
Fluent written English with clear documentation skills
Comfort reading structured formats such as JSON or YAML (no need to write code)
Ability to reason about complex systems and spot what could break or be misinterpreted

Preferred

Prior exposure to QA/test-case thinking, logic puzzles, or evaluation frameworks

Company

OpenTrain AI

twittertwittertwitter
company-logo
OpenTrain AI connects companies with vetted data labeling experts, supports any annotation tool, and manages escrow payments.

Funding

Current Stage
Early Stage

Leadership Team

W
Weston Dotson
Founder
linkedin
Company data provided by crunchbase