SME Careers · 5 hours ago
Hebrew Trust & Safety Data Trainer
SME Careers is a fast-growing AI Data Services company and a subsidiary of SuperAnnotate that provides AI training data for many of the world’s largest AI companies. The role involves reviewing AI-generated responses and generating safety-focused evaluation content, assessing reasoning quality, and providing expert feedback to ensure outputs are accurate and safe.
Responsibilities
Curate and label safety-focused training examples (including adversarial/red-team cases) in English and Hebrew that probe model behavior across hate/harassment, sexual content, self-harm, violence, bias, illegal services, malicious activity, malicious code, and misinformation—capturing nuance and intent with Minimum C1 English and near-native French proficiency
Review, score, and compare multiple model responses against safety policy and quality rubrics, documenting why an output is safe/unsafe and identifying failure modes such as evasion, normalization, escalation, or procedural enablement
Continuously stress-test and audit model behavior for policy gaps and edge cases; flag ambiguous scenarios, propose clearer decision rules, and help maintain consistent annotation standards across reviewers
Qualification
Required
Bachelor's degree or higher in a relevant field (e.g., Communications, Linguistics, Psychology, Law/Policy, Security Studies) or equivalent professional experience
Near-native or native Hebrew proficiency (reading/writing) for high-precision safety labeling and cultural-linguistic nuance
Minimum C1 English proficiency (reading/writing) for policy interpretation, prompt understanding, and consistent documentation
Experience in Trust & Safety, content moderation, policy enforcement, risk operations, investigations, or safety evaluation work
LLM red teaming experience is a must (proven ability to probe safety boundaries and document adversarial patterns)
Strong knowledge of safety domains: Hate & Harassment, Sexual content, Suicide & Self-Harm, Violence, Bias, Illegal goods/services, malicious activities, malicious code, and deliberate misinformation
Emotional resilience: an understanding that this role requires annotating texts that contain unsafe, explicit, and/or toxic content, including content of a sexual, violent, or psychologically disturbing nature
Excellent judgment under ambiguity, with the ability to apply written policies consistently and explain decisions succinctly
Comfort working as an hourly contractor: dependable throughput, clear documentation, and responsiveness across time zones
Preferred
Previous experience with AI data training / annotation / evaluation is preferred
Strong hands-on experience using tools like Perplexity, Gemini, ChatGPT and others
Company
SME Careers
SME Careers by SuperAnnotate connects subject-matter experts, students, and professionals with flexible, remote AI training work such as annotation, evaluation, fact-checking, and content review.
Funding
Current Stage
Early StageCompany data provided by crunchbase