Advanced Microdevices Pvt. Ltd. (India) · 3 weeks ago
Principal / Senior GPU Software Performance Engineer — Post‑Training
Advanced Micro Devices, Inc is a company focused on building innovative products that enhance computing experiences across various domains. They are seeking a Principal / Senior GPU Software Performance Engineer to drive performance on post-training workloads using AMD GPUs, optimizing training pipelines and collaborating with various teams to achieve measurable improvements.
BiopharmaBiotechnologyIndustrialManufacturing
Responsibilities
Lead performance for finetuning and RL training solutions on AMD GPUs
Improve throughput, memory efficiency, and stability across data, model, and optimizer steps
Optimize multi‑GPU/multi‑node training and communication patterns
Contribute efficient kernels/ops and targeted graph‑level optimizations
Profile, diagnose, and resolve bottlenecks using standard tooling; prevent regressions in CI
Ship reproducible pipelines and documentation adopted by internal teams and external developers
Collaborate with framework, compiler, and model teams to land durable improvements
Qualification
Required
Drive the performance of post-training workloads on AMD Instinct™ GPUs
Work across kernels, distributed training, and framework integrations to deliver fast, stable, and reproducible training pipelines on ROCm
Lead performance for finetuning and RL training solutions on AMD GPUs
Improve throughput, memory efficiency, and stability across data, model, and optimizer steps
Optimize multi-GPU/multi-node training and communication patterns
Contribute efficient kernels/ops and targeted graph-level optimizations
Profile, diagnose, and resolve bottlenecks using standard tooling; prevent regressions in CI
Ship reproducible pipelines and documentation adopted by internal teams and external developers
Collaborate with framework, compiler, and model teams to land durable improvements
B.S./M.S./Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
Preferred
Proven GPU performance engineering for deep learning (ROCm/HIP, Triton, or similar)
Hands-on with SFT, LoRA and RL-based training at scale
Strong PyTorch experience (torch.distributed, FSDP/ZeRO or equivalent)
Proficient in Python and C++; comfortable reading/writing kernels when needed
Experience with distributed systems and collective communication libraries
Track record of turning profiles into fixes, upstreaming changes, and documenting results
Benefits
AMD benefits at a glance.
Company
Advanced Microdevices Pvt. Ltd. (India)
Advanced Microdevices (mdi) is a leader in innovative membrane technologies.
Funding
Current Stage
Late StageLeadership Team
Nalini Kant Gupta
Founder & Managing Director
Recent News
2024-10-18
2024-10-16
Company data provided by crunchbase