This job has closed.

Uber · 17 hours ago

Senior Data Scientist - AI Red Teaming & Model Risk

New York, NY

Full-time

Onsite

Senior Level

$171K/yr - $190K/yr

5+ years exp

Uber is a leading company in the field of transportation and technology, seeking a Senior Data Scientist to join their AI Red Teaming efforts. The role focuses on adversarial evaluation, failure analysis, and risk discovery in AI models and agents, aiming to improve safety and robustness in AI systems.

LogisticsMobile AppsRide SharingSoftwareTransportation

H1B Sponsor Likely

Responsibilities

Design and execute AI red-teaming experiments against LLMs and AI agents to identify: prompt injection (direct & indirect), jailbreaking and policy bypass, model and tool poisoning, context and memory poisoning, behavioral drift and unsafe autonomy

Develop adversarial datasets, probes, and test harnesses to systematically evaluate model and agent behavior under attack

Define and track AI risk metrics beyond accuracy (e.g., failure rates, drift indicators, unsafe action likelihood, confidence miscalibration)

Analyze agent workflows and decision traces to understand how failures emerge across multi-step reasoning and tool use

Collaborate with security engineers and AI platform teams to translate findings into guardrails, mitigations, and design improvements

Build reusable evaluation pipelines to support continuous red teaming and regression testing as models and agents evolve

Qualification

AI red teamingLLMs experienceExperimental designPython proficiencyAdversarial evaluationModel safetyStatistical analysisComplex model behaviorSecurity interestAI evaluation tools

Required

5+ years of experience as a Data Scientist, Applied Scientist, or ML Scientist

Hands-on experience working with LLMs or generative AI systems

Direct experience with AI red teaming, model safety, or adversarial evaluation

Direct experience with prompt injection, jailbreaks, and LLM failure modes

Strong background in experimental design, evaluation, and statistical analysis

Experience analyzing complex model behavior and failure cases beyond standard metrics

Proficiency in Python and common DS/ML tooling

Preferred

Experience evaluating agentic systems, including tool use, memory, or multi-step workflows

Knowledge of GenAI architectures (transformers, embeddings, RAG, agent frameworks)

Experience building custom evaluation datasets or simulation environments

Background or strong interest in security, privacy, or trust & safety

Familiarity with AI evaluation tools (e.g., custom judges, LLM-as-judge, simulation frameworks)

Benefits

You will be eligible to participate in Uber's bonus program

May be offered an equity award & other types of comp

You will also be eligible for various benefits

Company

Uber

Glassdoor3.9

Uber develops, markets, and operates a ride-sharing mobile application that allows consumers to submit a trip request.

Founded in 2009

San Francisco, California, USA

10001+ employees

http://www.uber.com

H1B Sponsorship

Uber has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (830)

2024 (796)

2023 (684)

2022 (954)

2021 (750)

2020 (638)

Funding

Current Stage

Public Company

Total Funding

$35.56B

Key Investors

William AckmanPayPalToyota Motor

2025-09-08Post Ipo Debt· $2.25B

2025-05-13Post Ipo Debt· $1B

2025-01-01Post Ipo Equity· $2.3B

Leadership Team

Dara Khosrowshahi

CEO

Prashanth Mahendra -Rajah

Chief Financial Officer

Recent News

Indian Express

Why is the Davos WEF important for Maharashtra CM Fadnavis?

2026-01-25

Hindu Business Line

Karnataka HC lifts bike taxi ban, clears path for resumption of services across state

2026-01-24

Inc42 Media

Relief For Rapido, Uber: Karnataka HC Lifts Bike Taxi Ban

2026-01-24

Company data provided by crunchbase