Senior Data Scientist - AI Red Teaming & Model Risk jobs in United States
info-icon
This job has closed.
company-logo

Uber · 17 hours ago

Senior Data Scientist - AI Red Teaming & Model Risk

Uber is a leading company in the field of transportation and technology, seeking a Senior Data Scientist to join their AI Red Teaming efforts. The role focuses on adversarial evaluation, failure analysis, and risk discovery in AI models and agents, aiming to improve safety and robustness in AI systems.

LogisticsMobile AppsRide SharingSoftwareTransportation
check
H1B Sponsor Likelynote

Responsibilities

Design and execute AI red-teaming experiments against LLMs and AI agents to identify: prompt injection (direct & indirect), jailbreaking and policy bypass, model and tool poisoning, context and memory poisoning, behavioral drift and unsafe autonomy
Develop adversarial datasets, probes, and test harnesses to systematically evaluate model and agent behavior under attack
Define and track AI risk metrics beyond accuracy (e.g., failure rates, drift indicators, unsafe action likelihood, confidence miscalibration)
Analyze agent workflows and decision traces to understand how failures emerge across multi-step reasoning and tool use
Collaborate with security engineers and AI platform teams to translate findings into guardrails, mitigations, and design improvements
Build reusable evaluation pipelines to support continuous red teaming and regression testing as models and agents evolve

Qualification

AI red teamingLLMs experienceExperimental designPython proficiencyAdversarial evaluationModel safetyStatistical analysisComplex model behaviorSecurity interestAI evaluation tools

Required

5+ years of experience as a Data Scientist, Applied Scientist, or ML Scientist
Hands-on experience working with LLMs or generative AI systems
Direct experience with AI red teaming, model safety, or adversarial evaluation
Direct experience with prompt injection, jailbreaks, and LLM failure modes
Strong background in experimental design, evaluation, and statistical analysis
Experience analyzing complex model behavior and failure cases beyond standard metrics
Proficiency in Python and common DS/ML tooling

Preferred

Experience evaluating agentic systems, including tool use, memory, or multi-step workflows
Knowledge of GenAI architectures (transformers, embeddings, RAG, agent frameworks)
Experience building custom evaluation datasets or simulation environments
Background or strong interest in security, privacy, or trust & safety
Familiarity with AI evaluation tools (e.g., custom judges, LLM-as-judge, simulation frameworks)

Benefits

You will be eligible to participate in Uber's bonus program
May be offered an equity award & other types of comp
You will also be eligible for various benefits

Company

Uber develops, markets, and operates a ride-sharing mobile application that allows consumers to submit a trip request.

H1B Sponsorship

Uber has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (830)
2024 (796)
2023 (684)
2022 (954)
2021 (750)
2020 (638)

Funding

Current Stage
Public Company
Total Funding
$35.56B
Key Investors
William AckmanPayPalToyota Motor
2025-09-08Post Ipo Debt· $2.25B
2025-05-13Post Ipo Debt· $1B
2025-01-01Post Ipo Equity· $2.3B

Leadership Team

leader-logo
Dara Khosrowshahi
CEO
linkedin
leader-logo
Prashanth Mahendra -Rajah
Chief Financial Officer
linkedin
Company data provided by crunchbase