xAI · 2 weeks ago
RDMA Engineer - Supercomputing
xAI is on a mission to create AI systems that enhance humanity's understanding of the universe. They are seeking an RDMA Engineer to design and optimize networking solutions for GPU supercomputing clusters, focusing on low-latency and high-bandwidth communication systems.
Artificial Intelligence (AI)Information TechnologyFoundational AIGenerative AIMachine Learning
Responsibilities
Develop and tune RDMA-based communication systems leveraging NVIDIA GPUs and Mellanox NICs (InfiniBand, RoCE) for ultra-fast data transfer between nodes
Implement and optimize GPUDirect RDMA to enable direct memory access between GPUs and network interfaces, minimizing CPU overhead
Integrate RDMA solutions with Kubernetes-based workloads, ensuring seamless operation across distributed compute and storage systems
Collaborate with AI researchers and infrastructure teams to accelerate data pipelines and collective communications using NCCL and MPI
Troubleshoot and resolve performance bottlenecks in high-throughput, low-latency networking environments
Qualification
Required
Hands-on experience with NVIDIA RDMA technologies (e.g., GPUDirect RDMA, RoCE, InfiniBand) in HPC or AI supercomputing environments
Proficiency in programming with Rust, C, or C++ for low-level networking and system optimization
Familiarity with NVIDIA's networking stack, including Mellanox drivers, libraries (e.g., libibverbs), and tools (e.g., NVPeerMemory)
Experience optimizing distributed systems with MPI, NCCL, or similar frameworks for GPU-accelerated workloads
Knowledge of Kubernetes networking and integrating RDMA into containerized environments
Preferred
Background in AI/ML training workflows and their networking demands (e.g., large-scale parameter synchronization)
Company
xAI
XAI is an artificial intelligence startup that develops AI solutions and tools to enhance reasoning and search capabilities.
H1B Sponsorship
xAI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
Funding
Current Stage
Late StageTotal Funding
$42.73BKey Investors
Neptune Digital AssetsSpaceXMorgan Stanley
2026-02-02Acquired
2026-01-06Series E· $20B
2025-12-11Secondary Market· $0.3M
Recent News
2026-02-09
Business Standard India
2026-02-09
Company data provided by crunchbase