Member of Technical Staff - Enterprise Model Evaluation jobs in United States
cer-icon
Apply on Employer Site
company-logo

xAI · 1 week ago

Member of Technical Staff - Enterprise Model Evaluation

xAI is dedicated to creating AI systems that enhance humanity's understanding of the universe. The Member of Technical Staff - Enterprise Model Evaluation role involves designing and implementing model evaluations to improve model capabilities and ensure high standards before deployment.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design and implement next-generation evaluation suites beyond traditional benchmarks, creating frameworks that capture real-world utility and performance of Grok in production environments
Coordinate model evaluation efforts and collaborations to ensure comprehensive coverage and fast iterations
Integrate Grok into production systems, gain deep insights into real-world environments, and ensure alignment with user needs and business objectives
Partner with research teams to translate cutting-edge techniques and Grok models into production-ready implementations, optimizing for performance and impact

Qualification

Evaluation frameworksMachine learning modelsStatistical analysisExperimental designCommunication

Required

Proven expertise in designing and implementing sophisticated evaluation frameworks for machine learning models, especially LLMs
Experience with statistical analysis, experimental design, and benchmarking AI systems in real-world settings

Benefits

Equity
Comprehensive medical, vision, and dental coverage
Access to a 401(k) retirement plan
Short & long-term disability insurance
Life insurance
Various other discounts and perks

Company

xAI

twittertwittertwitter
company-logo
XAI is an artificial intelligence startup that develops AI solutions and tools to enhance reasoning and search capabilities.

H1B Sponsorship

xAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Late Stage
Total Funding
$42.73B
Key Investors
Neptune Digital AssetsSpaceXMorgan Stanley
2026-01-06Series E· $20B
2025-12-11Secondary Market· $0.3M
2025-07-13Corporate Round· $5.32B

Leadership Team

leader-logo
Greg Yang
Co-Founder
linkedin
leader-logo
Yuhuai Wu
Co-Founder
linkedin
Company data provided by crunchbase