Principal / Senior GPU Software Performance Engineer — Post‑Training jobs in United States
cer-icon
Apply on Employer Site
company-logo

Advanced Microdevices Pvt. Ltd. (India) · 3 weeks ago

Principal / Senior GPU Software Performance Engineer — Post‑Training

Advanced Micro Devices, Inc is a company focused on building innovative products that enhance computing experiences across various domains. They are seeking a Principal / Senior GPU Software Performance Engineer to drive performance on post-training workloads using AMD GPUs, optimizing training pipelines and collaborating with various teams to achieve measurable improvements.

BiopharmaBiotechnologyIndustrialManufacturing

Responsibilities

Lead performance for finetuning and RL training solutions on AMD GPUs
Improve throughput, memory efficiency, and stability across data, model, and optimizer steps
Optimize multi‑GPU/multi‑node training and communication patterns
Contribute efficient kernels/ops and targeted graph‑level optimizations
Profile, diagnose, and resolve bottlenecks using standard tooling; prevent regressions in CI
Ship reproducible pipelines and documentation adopted by internal teams and external developers
Collaborate with framework, compiler, and model teams to land durable improvements

Qualification

GPU performance engineeringDeep learning frameworksPyTorchPythonC++Distributed systemsCollaborationCommunicationProblem-solving

Required

Drive the performance of post-training workloads on AMD Instinct™ GPUs
Work across kernels, distributed training, and framework integrations to deliver fast, stable, and reproducible training pipelines on ROCm
Lead performance for finetuning and RL training solutions on AMD GPUs
Improve throughput, memory efficiency, and stability across data, model, and optimizer steps
Optimize multi-GPU/multi-node training and communication patterns
Contribute efficient kernels/ops and targeted graph-level optimizations
Profile, diagnose, and resolve bottlenecks using standard tooling; prevent regressions in CI
Ship reproducible pipelines and documentation adopted by internal teams and external developers
Collaborate with framework, compiler, and model teams to land durable improvements
B.S./M.S./Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, or equivalent

Preferred

Proven GPU performance engineering for deep learning (ROCm/HIP, Triton, or similar)
Hands-on with SFT, LoRA and RL-based training at scale
Strong PyTorch experience (torch.distributed, FSDP/ZeRO or equivalent)
Proficient in Python and C++; comfortable reading/writing kernels when needed
Experience with distributed systems and collective communication libraries
Track record of turning profiles into fixes, upstreaming changes, and documenting results

Benefits

AMD benefits at a glance.

Company

Advanced Microdevices Pvt. Ltd. (India)

twittertwittertwitter
company-logo
Advanced Microdevices (mdi) is a leader in innovative membrane technologies.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Nalini Kant Gupta
Founder & Managing Director
Company data provided by crunchbase