Research Scientist / Engineer – Training Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Luma AI · 1 hour ago

Research Scientist / Engineer – Training Infrastructure

Luma AI is dedicated to building multimodal AI to enhance human imagination and capabilities. The role focuses on developing and maintaining distributed systems for training large-scale multimodal models using thousands of GPUs, enabling researchers to innovate effectively.

Artificial Intelligence (AI)Generative AIVideoVideo Editing
check
H1B Sponsor Likelynote

Responsibilities

Design, implement, and optimize efficient distributed training systems for models with thousands of GPUs
Research and implement advanced parallelization techniques (FSDP, Tensor Parallel, Pipeline Parallel, Expert Parallel)
Build monitoring, visualization, and debugging tools for large-scale training runs
Optimize training stability, convergence, and resource utilization across massive clusters

Qualification

Distributed PyTorch trainingGPU clustersDistributed systems optimizationCUDACommunication librariesLinux systems administrationContainerizationCloud infrastructure

Required

significant experience solving hard problems in PyTorch
significant experience solving hard problems in CUDA
significant experience solving hard problems in distributed systems
design, implement, and optimize efficient distributed training systems for models with thousands of GPUs
research and implement advanced parallelization techniques (FSDP, Tensor Parallel, Pipeline Parallel, Expert Parallel)
build monitoring, visualization, and debugging tools for large-scale training runs
optimize training stability, convergence, and resource utilization across massive clusters
extensive experience with distributed PyTorch training and parallelisms in foundation model training
deep understanding of GPU clusters, networking, and storage systems
familiarity with communication libraries (NCCL, MPI) and distributed system optimization

Preferred

strong Linux systems administration and scripting capabilities
experience managing training runs across >100 GPUs
experience with containerization, orchestration, and cloud infrastructure

Company

Luma AI

twittertwittertwitter
company-logo
Luma AI develops tools that let users generate photorealistic images and videos from text, image, or video prompts.

H1B Sponsorship

Luma AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (10)
2024 (3)

Funding

Current Stage
Growth Stage
Total Funding
$1.06B
Key Investors
HUMAINAndreessen HorowitzAmplify Partners
2025-11-19Series C· $900M
2024-12-06Series B· $90M
2024-01-09Series B· $43M

Leadership Team

leader-logo
Amit Jain
Co-Founder
linkedin
Company data provided by crunchbase