MyRemoteTeam Inc · 2 months ago
AI Prompt Engineer – HLE (Humanity’s Last Exam
MyRemoteTeam Inc is building elite teams to evaluate and advance the next generation of GenAI models. The AI Prompt Engineer will review and refine prompts, analyze model responses, and collaborate with researchers to create evaluation frameworks, ensuring high-quality outputs and human reasoning alignment.
Responsibilities
Review, analyze, and refine prompts used to train cutting-edge GenAI systems
Decide which model responses are correct, incorrect, higher quality, or more aligned with human reasoning
Work with GenAI researchers & engineers to understand evaluation goals and data workflows
Convert high-level requirements into clear, detailed evaluation frameworks
Create research-backed, high-difficulty questions that challenge AI models ("Humanity’s Last Exam" style)
Conduct deep research using large language models and trusted sources
Innovate new prompt strategies and evaluation rubrics
Ensure factual accuracy, recall, precision, and bias-free content
Maintain extremely high quality while working with minimal supervision
Qualification
Required
Review, analyze, and refine prompts used to train cutting-edge GenAI systems
Decide which model responses are correct, incorrect, higher quality, or more aligned with human reasoning
Work with GenAI researchers & engineers to understand evaluation goals and data workflows
Convert high-level requirements into clear, detailed evaluation frameworks
Create research-backed, high-difficulty questions that challenge AI models ('Humanity's Last Exam' style)
Conduct deep research using large language models and trusted sources
Innovate new prompt strategies and evaluation rubrics
Ensure factual accuracy, recall, precision, and bias-free content
Maintain extremely high quality while working with minimal supervision
Strong writing and content development experience
Ability to design complex, original problems across your domain
Deep research and fact-checking ability
A unique human voice — no AI-style mimicry
Precision, logic, and top-tier attention to detail
Strong prompt engineering skills (few-shot, chain-of-thought, reasoning prompts, etc.)
Familiarity with LLM behavior, limitations, and error patterns
Experience using AI tools for research and refinement
Ethical understanding of bias mitigation and responsible AI usage
Solid data literacy for LLM training workflows (cleaning, verification, preparation)
Preferred
Master's with distinction or PhD preferred
Fields: Finance, Engineering, Physics, Chemistry, Medicine
Bachelor's degree accepted with proven advanced experience
Graduated from leading US or international institutions (preferred)