Principal / Senior GPU Software Performance Engineer — Post‑Training jobs in United States
cer-icon
Apply on Employer Site
company-logo

AMD · 6 days ago

Principal / Senior GPU Software Performance Engineer — Post‑Training

AMD is a company dedicated to building innovative products that drive next-generation computing experiences. The Principal / Senior GPU Software Performance Engineer will focus on optimizing post-training workloads on AMD GPUs, ensuring performance improvements and stability across various training solutions.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingComputerEmbedded SystemsGPUHardwareSemiconductor
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Lead performance for finetuning and RL training solutions on AMD GPUs
Improve throughput, memory efficiency, and stability across data, model, and optimizer steps
Optimize multi‑GPU/multi‑node training and communication patterns
Contribute efficient kernels/ops and targeted graph‑level optimizations
Profile, diagnose, and resolve bottlenecks using standard tooling; prevent regressions in CI
Ship reproducible pipelines and documentation adopted by internal teams and external developers
Collaborate with framework, compiler, and model teams to land durable improvements

Qualification

GPU performance engineeringDeep learning frameworksPythonC++Distributed systemsProfilingDiagnosticsCollaborationCommunicationProblem-solving

Required

Drive the performance of post-training workloads on AMD Instinct™ GPUs
Work across kernels, distributed training, and framework integrations to deliver fast, stable, and reproducible training pipelines on ROCm
Lead performance for finetuning and RL training solutions on AMD GPUs
Improve throughput, memory efficiency, and stability across data, model, and optimizer steps
Optimize multi-GPU/multi-node training and communication patterns
Contribute efficient kernels/ops and targeted graph-level optimizations
Profile, diagnose, and resolve bottlenecks using standard tooling; prevent regressions in CI
Ship reproducible pipelines and documentation adopted by internal teams and external developers
Collaborate with framework, compiler, and model teams to land durable improvements
B.S./M.S./Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, or equivalent

Preferred

Proven GPU performance engineering for deep learning (ROCm/HIP, Triton, or similar)
Hands-on with SFT, LoRA and RL-based training at scale
Strong PyTorch experience (torch.distributed, FSDP/ZeRO or equivalent)
Proficient in Python and C++; comfortable reading/writing kernels when needed
Experience with distributed systems and collective communication libraries
Track record of turning profiles into fixes, upstreaming changes, and documenting results

Benefits

AMD benefits at a glance.

Company

Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions.

H1B Sponsorship

AMD has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (836)
2024 (770)
2023 (551)
2022 (739)
2021 (519)
2020 (547)

Funding

Current Stage
Public Company
Total Funding
unknown
Key Investors
OpenAIDaniel Loeb
2025-10-06Post Ipo Equity
2023-03-02Post Ipo Equity
2021-06-29Post Ipo Equity

Leadership Team

leader-logo
Lisa Su
Chair & CEO
linkedin
leader-logo
Mark Papermaster
CTO and EVP
linkedin
Company data provided by crunchbase