Senior Hardware Engineer - GPU & AI Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Roblox · 3 days ago

Senior Hardware Engineer - GPU & AI Infrastructure

Roblox is a platform where millions come to create and connect in immersive digital experiences. They are seeking a Senior Hardware Engineer to lead the GPU and AI infrastructure, focusing on optimizing hardware for high-performance rendering and machine learning workloads.

3D TechnologyGamingMetaverseOnline GamesSoftwareVideo Games
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Architect & Prototype: Prototype next-generation GPU-accelerated hardware platforms, ensuring seamless integration between high-density compute nodes, high-speed interconnects (NVLink/PCIe Gen5/6), and system firmware
GPU Optimization: Drive the integration, performance testing, and debugging of GPUs in our fleet, focusing specifically on hardware-level optimizations, driver tuning, and thermal/power management
Validation & Certification: Develop and execute rigorous evaluation and stress-testing strategies for GPU-heavy server platforms to ensure they meet Roblox’s unique demands for real-time rendering and low-latency AI inference
Firmware & Systems: Lead firmware qualification (BIOS/BMC) and troubleshooting, implementing automation systems to manage GPU health, firmware updates
Vendor Collaboration: Provide technical guidance and deep-dive feedback to hardware vendors. Lead critical investigations into component-level failures, triaging issues across the hardware, driver, and kernel layers
Observability: Build and maintain advanced monitoring stacks (Grafana/Prometheus) to track GPU metrics like HBM utilization, thermal throttling events, and PCIe bandwidth saturation

Qualification

GPU architectureAI acceleratorsHigh-performance computeFirmware qualificationPythonGoC++PCIe fabricNVLinkInfiniBandLiquid cooling systemsCommunication skillsProblem solvingCollaborationAdaptability

Required

BA/BS Degree in Electrical Engineering, Computer Engineering, or related field with equivalent practical experience
5+ years of hardware engineering experience with a specific focus on GPU architecture, AI accelerators, or high-performance compute (HPC) systems
In-depth understanding of modern data center technologies, including PCIe fabric, NVLink, InfiniBand, and liquid cooling systems for high-TDP hardware
Hands-on experience testing and validating CPU, Memory (HBM/DDR5), Storage (NVMe), and high-speed networking subsystems in a Linux environment
Proficiency in Python, Go, or C++ for developing hardware validation tools and automation scripts
Expert-level skills in debugging complex server issues remotely, with the ability to analyze kernel logs, hardware registers, and bus-level captures

Preferred

NVIDIA HGX/MGX platforms preferred

Benefits

Equity compensation
Benefits as described on this page

Company

Roblox is an online gaming and entertainment platform that offers a shared digital experience that brings people together through play.

H1B Sponsorship

Roblox has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (240)
2024 (111)
2023 (139)
2022 (153)
2021 (91)
2020 (92)

Funding

Current Stage
Public Company
Total Funding
$874.45M
Key Investors
Andreessen HorowitzAltos Ventures
2024-10-01Post Ipo Equity
2022-04-12Series Unknown· $17.71M
2021-08-11Series Unknown

Leadership Team

leader-logo
David Baszucki
Founder and CEO
linkedin
leader-logo
Anupam Singh
VP Engineering, AI Platform, Infrastructure
linkedin
Company data provided by crunchbase