Member of Technical Staff, Model Evaluation jobs in United States
cer-icon
Apply on Employer Site
company-logo

xAI · 1 week ago

Member of Technical Staff, Model Evaluation

xAI is dedicated to creating AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. The role involves assessing models, identifying weaknesses in model training, and collaborating with teams to enhance model quality.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Provide complete assessment of models
Deep dive into model training and data to identify the weakness point revealed in evaluation
Communicate with modeling and data team to come up with plans to improve model quality

Qualification

PythonModel assessmentJAXXLARust / C++SparkPrioritization skillsCommunication skillsWork ethic

Required

Strong communication skills
Hands-on contribution to the company's mission
Ability to prioritize tasks effectively
Experience with Python
Experience with JAX and XLA
Experience with Rust / C++
Experience with Spark
Experience in model assessment and evaluation task development (including public and in-house benchmarking)
Ability to collect data and synthesize data for new evaluations
Experience building infrastructure and framework for easy-to-use model evaluation
Familiarity with inference frameworks like SGlang and vLLM

Preferred

Located near the Bay Area or open to relocation

Benefits

Equity
Comprehensive medical, vision, and dental coverage
Access to a 401(k) retirement plan
Short & long-term disability insurance
Life insurance
Various other discounts and perks

Company

xAI

twittertwittertwitter
company-logo
XAI is an artificial intelligence startup that develops AI solutions and tools to enhance reasoning and search capabilities.

H1B Sponsorship

xAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Late Stage
Total Funding
$42.73B
Key Investors
Neptune Digital AssetsSpaceXMorgan Stanley
2026-01-06Series E· $20B
2025-12-11Secondary Market· $0.3M
2025-07-13Corporate Round· $5.32B

Leadership Team

leader-logo
Toby Pohlen
Founding Member
linkedin
Company data provided by crunchbase