xAI · 4 hours ago
Member of Technical Staff, Pre-training Data
xAI is dedicated to creating AI systems that can understand the universe and assist humanity in gaining knowledge. They are seeking a visionary engineer for their pre-training data team to develop and innovate data recipes for training multimodal models that analyze text, image, video, and audio.
Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
Responsibilities
Collaborate with the crawling team to discover and source datasets
Architect pipelines to transform datasets at petabyte scales with efficiency and precision
Develop robust and diverse evaluations for pre-training models
Craft insightful experiments to assess dataset performance
Innovate the recipe for scaling pre-training data to new frontiers
Qualification
Required
Strong communication skills
Hands-on experience in engineering
Ability to collaborate with teams
Experience in architecting pipelines for large datasets
Proficiency in Python
Experience with Spark
Experience with Ray
Ability to develop evaluations for pre-training models
Experience in crafting experiments to assess dataset performance
Preferred
Expertise in ML and large model scaling
Familiarity with scaling laws
Strong ability to design ML experiments
Familiarity with state-of-the-art techniques for curating AI training data for text, image, audio, and video modalities
Strong engineering abilities in Spark, Ray, and other frameworks for large-scale data processing
Benefits
Equity
Comprehensive medical, vision, and dental coverage
Access to a 401(k) retirement plan
Short & long-term disability insurance
Life insurance
Various other discounts and perks
Company
xAI
XAI is an artificial intelligence startup that develops AI solutions and tools to enhance reasoning and search capabilities.
H1B Sponsorship
xAI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
Funding
Current Stage
Late StageTotal Funding
$42.73BKey Investors
Neptune Digital AssetsSpaceXMorgan Stanley
2026-02-02Acquired
2026-01-06Series E· $20B
2025-12-11Secondary Market· $0.3M
Recent News
thepeninsulaqatar.com
2026-02-04
thepeninsulaqatar.com
2026-02-04
2026-02-04
Company data provided by crunchbase