Freelance Agent Evaluation Analyst jobs in United States
cer-icon
Apply on Employer Site
company-logo

Mindrift · 3 days ago

Freelance Agent Evaluation Analyst

Mindrift is a company focused on leveraging collective human intelligence to enhance the future of AI. They are seeking a Freelance Agent Evaluation Analyst to review and improve evaluation tasks for AI agents, ensuring logical consistency and completeness.

Computer Software

Responsibilities

Reviewing evaluation tasks and scenarios for logic, completeness, and realism
Identifying inconsistencies, missing assumptions, or unclear decision points
Helping define clear expected behaviors (gold standards) for AI agents
Annotating cause-effect relationships, reasoning paths, and plausible alternatives
Thinking through complex systems and policies as a human would to ensure agents are tested properly
Working closely with QA, writers, or developers to suggest refinements or edge case coverage

Qualification

Analytical thinkingStructured data formatsAttention to detailPolicy evaluationQATest-case thinkingCommunication skillsExperience in consultingExposure to LLMsScoring

Required

Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications
Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements
Familiarity with structured data formats: Can read, not necessarily write JSON/YAML
Ability to assess scenarios holistically: What's missing, what's unrealistic, what might break?
Good communication and clear writing (in English) to document your findings

Preferred

Experience with policy evaluation, logic puzzles, case studies, or structured scenario design
Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research
Exposure to LLMs, prompt engineering, or AI-generated content
Familiarity with QA or test-case thinking (edge cases, failure modes, 'what could go wrong')
Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.)

Benefits

Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs
Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
Participate in an advanced AI project and gain valuable experience to enhance your portfolio
Influence how future AI models understand and communicate in your field of expertise

Company

Mindrift

twitter
company-logo
Welcome to Mindrift — a space where innovation meets opportunity.

Funding

Current Stage
Late Stage
Company data provided by crunchbase