Adversarial AI Testing (Advanced); English & Chinese jobs in United States
cer-icon
Apply on Employer Site
company-logo

Great Value Hiring ยท 1 day ago

Adversarial AI Testing (Advanced); English & Chinese

Great Value Hiring is seeking human data experts who probe AI models with adversarial inputs to surface vulnerabilities and generate data that makes AI safer for customers. The role involves red teaming conversational AI models, generating high-quality human data, and documenting findings for actionable insights.

Staffing & Recruiting

Responsibilities

Red team conversational AI models and agents: jailbreaks, prompt injections, misuse cases, bias exploitation, multi-turn manipulation
Generate high-quality human data: annotate failures, classify vulnerabilities, and flag systemic risks
Apply structure: follow taxonomies, benchmarks, and playbooks to keep testing consistent
Document reproducibly: produce reports, datasets, and attack cases customers can act on

Qualification

Adversarial MLCybersecuritySocio-technical riskFluency in EnglishChineseCreative probing

Required

Strong fluency in English and Chinese
Prior red teaming experience (AI adversarial work, cybersecurity, socio-technical probing)
Curiosity and adversarial mindset: instinctively push systems to breaking points
Structured approach: use frameworks or benchmarks, not just random hacks
Strong communication skills: explain risks clearly to technical and non-technical stakeholders
Adaptability: thrive on moving across projects and customers
Adversarial ML skills: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction
Cybersecurity skills: penetration testing, exploit development, reverse engineering
Socio-technical risk skills: harassment/disinfo probing, abuse analysis, conversational AI testing
Creative probing skills: psychology, acting, writing for unconventional adversarial thinking

Company

Great Value Hiring

twitter
company-logo
We started "Great Value Hiring" with a simple idea: to make meaningful connections.

Funding

Current Stage
Early Stage
Company data provided by crunchbase