Crossing Hurdles · 5 hours ago
AI Evaluation (Safety Expert) | $90/hr Remote
Crossing Hurdles is seeking an AI Evaluation – Safety Specialist to evaluate AI-generated text against safety criteria. The role involves annotating and documenting evaluations while collaborating with research and safety teams to enhance AI safety research and model improvements.
Staffing & Recruiting
Responsibilities
Annotate and evaluate AI-generated text against safety criteria such as bias, misinformation, unsafe reasoning, and disallowed content
Apply harm taxonomies and evolving safety guidelines consistently, including in ambiguous scenarios
Document clear reasoning to improve evaluation frameworks and internal guidelines
Identify subtle unsafe behaviors, biases, or inconsistencies that automated systems may miss
Contribute high-quality human data that supports AI safety research, model improvements, and risk audits
Collaborate asynchronously with research and safety teams in a fast-moving environment
Qualification
Required
Experience in model evaluation, structured annotation, applied research, or related analytical roles
Strong ability to detect bias, edge cases, and nuanced safety risks in AI outputs
Clear written communication and the ability to explain and defend evaluation decisions
Comfort working in experimental environments where methods evolve rapidly
High attention to detail and consistency across large volumes of text-based tasks
Ability to work independently in a remote, project-based setup
Company
Crossing Hurdles
At Crossing Hurdles, we specialise in customised recruitment and staffing solutions designed to drive success for businesses and professionals.
Funding
Current Stage
Early StageCompany data provided by crunchbase