AI Engineer & Researcher - GPU Kernel jobs in United States
cer-icon
Apply on Employer Site
company-logo

xAI · 1 day ago

AI Engineer & Researcher - GPU Kernel

xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. The AI Engineer & Researcher will focus on developing and optimizing CUDA kernels for GPU operations, contributing to the advancement of deep learning technologies.

Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyMachine Learning
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Developing and improving low-level CUDA kernel optimizations for state-of-the-art inference and training software stack
Profiling, debugging, and optimizing single and multi-GPU operations using tools such as Nsight
Understanding GPU memory hierarchy and computation capabilities
Implementing the latest methods from the deep learning literature in low-level CUDA kernels
Innovating new ideas that bring us closer to the limits of a GPU

Qualification

CUDAC/C++PythonNsightGeMM CUDA kernelsPybindPrioritization skillsCommunication skillsWork ethic

Required

Strong communication skills to concisely and accurately share knowledge with teammates
Experience with CUDA
Experience with CUTLASS
Proficiency in C/C++ and Python binding tools
Experience in developing and improving low-level CUDA kernel optimizations for state-of-the-art inference and training software stack
Experience in profiling, debugging, and optimizing single and multi-GPU operations using tools such as Nsight
Understanding of GPU memory hierarchy and computation capabilities
Ability to implement the latest methods from the deep learning literature in low-level CUDA kernels
Innovative thinking to bring new ideas that push the limits of a GPU
Experience building high-performance GeMM CUDA kernels using Tensor cores or CUDA cores from scratch or by utilizing CuTe/CUTLASS
Experience implementing features for attention kernel by extending existing kernels or writing them from scratch
Comfortable with writing both forward and backward kernels and ensuring correctness while considering floating point errors
Experience optimizing for both memory-bound and compute-bound operations
Ability to reason about register pressure, shared-memory usage, and GPU utilization through tools such as Nsight and removing bottlenecks
Familiarity with the latest and most effective techniques in optimizing inference and training workloads
Experience using pybind to integrate custom-written kernels into a framework, especially JAX/XLA

Company

xAI

twittertwittertwitter
company-logo
XAI is an artificial intelligence startup that develops AI solutions and tools to enhance reasoning and search capabilities.

H1B Sponsorship

xAI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Late Stage
Total Funding
$42.73B
Key Investors
Neptune Digital AssetsSpaceXMorgan Stanley
2026-01-06Series E· $20B
2025-12-11Secondary Market· $0.3M
2025-07-13Corporate Round· $5.32B

Leadership Team

leader-logo
Greg Yang
Co-Founder
linkedin
leader-logo
Yuhuai Wu
Co-Founder
linkedin
Company data provided by crunchbase