AI Engineer & Researcher - Inference jobs in United States
cer-icon
Apply on Employer Site
company-logo

xAI · 1 day ago

AI Engineer & Researcher - Inference

xAI is focused on creating AI systems that enhance humanity's understanding of the universe. The AI Engineer & Researcher will be responsible for optimizing model inference latency and throughput, building reliable production serving systems, and accelerating research on scaling test-time compute.

Artificial Intelligence (AI)Generative AIInformation TechnologyMachine Learning
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Optimizing the latency and throughput of model inference
Building reliable production serving systems to serve millions of users
Accelerating research on scaling test-time compute

Qualification

PythonPyTorchCUDARustKubernetesSGLangModel optimizationPrioritization skillsCommunicationWork ethic

Required

Strong communication skills
Experience with Python / Rust
Experience with PyTorch / JAX
Experience with CUDA / CUTLASS / Triton / NCCL
Experience with Kubernetes
Experience with system optimizations for model serving, such as batching, caching, load balancing, and model parallelism
Experience with low-level optimizations for inference, such as GPU kernels and code generation
Experience with algorithmic optimizations for inference, such as quantization, distillation, and speculative decoding
Experience with large-scale, high concurrent production serving
Experience with testing, benchmarking, and reliability of inference services

Company

xAI

twittertwittertwitter
company-logo
XAI is an artificial intelligence startup that develops AI solutions and tools to enhance reasoning and search capabilities.

H1B Sponsorship

xAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Growth Stage
Total Funding
$22.73B
Key Investors
Neptune Digital AssetsSpaceXMorgan Stanley
2025-12-11Secondary Market· $0.3M
2025-07-13Corporate Round· $5.32B
2025-07-01Debt Financing· $5B

Leadership Team

leader-logo
Toby Pohlen
Founding Member
linkedin
Company data provided by crunchbase