Apply on Employer Site

Apple · 6 hours ago

AI Evaluation Engineer - Health

Cupertino, CA

Full-time

Onsite

Senior Level, Lead/Staff

$181K/yr - $318K/yr

10+ years exp

Apple is a leading technology company focused on enhancing health and well-being through innovative technologies. The AI Evaluation Engineer - Health will develop and validate methodologies for evaluating Generative AI systems in health applications, ensuring the reliability and trustworthiness of AI features through comprehensive evaluation frameworks and statistical analyses.

AppsArtificial Intelligence (AI)BroadcastingDigital EntertainmentFoundational AIMedia and EntertainmentMobile DevicesOperating SystemsTVWearables

Comp. & Benefits

H1B Sponsor Likely

Responsibilities

Design and implement evaluation frameworks for measuring model performance, including human annotation protocols, quality control mechanisms, statistical reliability analysis, and LLM-based autograders to scale evaluation

Apply statistical methods to extract meaningful signals from human-annotated datasets, derive actionable insights, and implement improvements to models and evaluation methodologies

Analyze model behavior, identify weaknesses, and drive design decisions with failure analysis. Examples include, but not limited to: model experimentation, adversarial testing, creating insight/interpretability tools to understand and predict failure modes

Work across the entire ML development cycle, such as developing and managing data from various endpoints, managing ML training jobs with large datasets, and building efficient and scalable model evaluation pipelines

Collaborate with engineers to build reliable end-to-end pipelines for long-term projects

Work cross-functionally to apply algorithms to real-world applications with designers, clinical experts, and engineering teams across Hardware and Software

Independently run and analyze ML experiments for real improvements

Qualification

PythonStatistical analysisLLM developmentData processing pipelinesHuman annotation frameworksCommunication skillsCross-functional collaborationCustomer-focused mindset

Required

BS and a minimum of 10 years relevant industry experience

Proficiency in Python and ability to write clean, performant code and collaborate using standard software development practices

Experience in building data and inference pipelines to process large scale datasets

Strong statistical analysis skills and experience validating data quality and model performance

Experience with applied LLM development, prompt engineering, chain of thought, etc

Preferred

MS or PhD in relevant fields

Experience with LLM-based evaluation systems and synthetic data generation techniques, and evaluating and improving such systems

Experience in rigorous, evidence-based approaches to test development, e.g. quantitative and qualitative test design, reliability and validity analysis

Customer-focused mindset with experience or strong interest in building consumer digital health and wellness products

Strong communication skills and ability to work cross-functionally with technical and non-technical stakeholders

Benefits

Comprehensive medical and dental coverage

Retirement benefits

A range of discounted products and free services

Reimbursement for certain educational expenses — including tuition

Discretionary bonuses or commission payments

Relocation

Company

Apple

Glassdoor4.2

Apple is a technology company that designs, manufactures, and markets consumer electronics, personal computers, and software.

Founded in 1976

Cupertino, California, USA

10001+ employees

https://www.apple.com

H1B Sponsorship

Apple has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (6998)

2024 (3766)

2023 (3939)

2022 (4822)

2021 (4060)

2020 (3656)

Funding

Current Stage

Public Company

Total Funding

$5.67B

Key Investors

Berkshire HathawayMicrosoftSequoia Capital

2025-05-05Post Ipo Debt· $4.5B

2025-01-16Post Ipo Debt· $0.31M

2021-04-30Post Ipo Equity

Leadership Team

Tim Cook

CEO

Craig Federighi

SVP, Software Engineering

Recent News

Venrock

Venrock Portfolio

2025-12-01

IndiaTimes

Apple TV+ pulls Jessica Chastain’s "The Savant" just days before premiere

2025-09-25

Mac Daily News

Jessica Chastain: I respect Apple’s decision to pause the release of ‘The Savant’ or something

2025-09-25

Company data provided by crunchbase