xAI · 1 day ago
AI Engineer & Researcher - Inference
xAI is focused on creating AI systems that enhance humanity's understanding of the universe. The AI Engineer & Researcher will be responsible for optimizing model inference latency and throughput, building reliable production serving systems, and accelerating research on scaling test-time compute.
Artificial Intelligence (AI)Generative AIInformation TechnologyMachine Learning
Responsibilities
Optimizing the latency and throughput of model inference
Building reliable production serving systems to serve millions of users
Accelerating research on scaling test-time compute
Qualification
Required
Strong communication skills
Experience with Python / Rust
Experience with PyTorch / JAX
Experience with CUDA / CUTLASS / Triton / NCCL
Experience with Kubernetes
Experience with system optimizations for model serving, such as batching, caching, load balancing, and model parallelism
Experience with low-level optimizations for inference, such as GPU kernels and code generation
Experience with algorithmic optimizations for inference, such as quantization, distillation, and speculative decoding
Experience with large-scale, high concurrent production serving
Experience with testing, benchmarking, and reliability of inference services
Company
xAI
XAI is an artificial intelligence startup that develops AI solutions and tools to enhance reasoning and search capabilities.
H1B Sponsorship
xAI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
Funding
Current Stage
Growth StageTotal Funding
$22.73BKey Investors
Neptune Digital AssetsSpaceXMorgan Stanley
2025-12-11Secondary Market· $0.3M
2025-07-13Corporate Round· $5.32B
2025-07-01Debt Financing· $5B
Recent News
Company data provided by crunchbase