Senior Staff Software Development Engineer- GPU, LLM, AI jobs in United States
cer-icon
Apply on Employer Site
company-logo

AMD ยท 2 hours ago

Senior Staff Software Development Engineer- GPU, LLM, AI

AMD is a company dedicated to building innovative products that accelerate next-generation computing experiences. They are seeking a Senior Staff Software Development Engineer to improve the performance of key applications and benchmarks, focusing on AI and GPU technologies.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingComputerEmbedded SystemsGPUHardwareSemiconductor
check
Growth Opportunities
check
H1B Sponsor Likelynote
Hiring Manager
Tressa Cooper (she/her)
linkedin

Responsibilities

Architect and Drive the AI Software Stack: You will establish best practices and optimize performance from the lowest-level GPU kernels to large-scale distributed systems, shaping the foundational software for AMD hardware. By leveraging cutting-edge Large Language Models (LLMs) and agent-based technologies, you will accelerate the development and performance enhancement of the AMD ROCm ecosystem, ensuring it remains at the forefront of AI innovation
Accelerate Foundational Models: Your work will directly accelerate cutting-edge applications like foundation models (LLMs) and autonomous AI agents, ensuring AMD is the platform of choice for the most demanding workloads
Innovate Across Hardware and Software: You will contribute to the entire co-design lifecycle, from influencing future GPU architectures to developing groundbreaking software for new accelerators and collaborating with the broader AI community

Qualification

C++GPU programmingLarge Language ModelsAI systemsKernel optimizationGPU architecturePerformance analysis toolsTechnical ownershipCommunication skillsProblem-solving skills

Required

Exceptional technical expertise in high-performance C++ software engineering and low-level GPU programming
Robust understanding of Large Language Models (LLMs) and AI systems
Ability to bridge kernel engineering with AI post-training (RL) experience
Demonstrating mastery in designing complex, scalable systems using modern C++
Fundamental grasp of GPU architectures (HIP/CUDA), memory hierarchies, and kernel optimization to maximize hardware performance
Significant hands-on experience in large-scale C++/HIP/CUDA projects, such as contributing to the ROCm ecosystem (e.g., rocBLAS, hipDNN, Composable Kernel, AITemplate), CUDA libraries (e.g., cuBLAS, cuDNN, CUTLASS, Thrust, CUB, NCCL), or the C++/HIP/CUDA core of ML frameworks like PyTorch, TensorFlow, or JAX
Deep understanding of LLMs, including but not limited to transformer architectures, attention mechanisms, and the full model lifecycle
Hands-on experience in advanced model alignment and post-training techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning (e.g., RLHF, GRPO)
Familiarity with cutting-edge trends such as Mixture-of-Experts (MoE) architectures, inference optimizations (e.g., quantization, speculative decoding), and modern application patterns like Agentic AI systems (e.g. AlphaEvolve for code/kernel generation)
Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent

Preferred

Lengthy professional software development experience in performance-critical environments
Extensive hands-on experience in GPU programming (HIP/CUDA) and optimizing deep learning kernels and operators
A fundamental understanding of GPU architecture and memory hierarchy, used to diagnose and resolve complex performance bottlenecks
Expert-level proficiency in modern C++ and object-oriented design
Deep experience using GPU profiling and performance analysis tools (e.g., AMD ROCm Profiler, NVIDIA Nsight) to diagnose and resolve complex bottlenecks in distributed, multi-GPU systems
Deep knowledge of transformer architectures, attention mechanisms, and modern AI systems (Generative AI, Agentic AI)
Hands-on experience optimizing the post-training and inference pipelines of Large Language Models (LLMs)
Strong technical ownership, communication, and problem-solving skills with a track record of delivering complex technical solutions
Plus: Experience or deep expertise with the AMD ROCm/HIP ecosystem
Master's degree preferred, PhD is a plus
Relevant publications in AI/ML, GPU computing, or system optimization are highly valued

Benefits

AMD benefits at a glance.

Company

Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions.

H1B Sponsorship

AMD has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (836)
2024 (770)
2023 (551)
2022 (739)
2021 (519)
2020 (547)

Funding

Current Stage
Public Company
Total Funding
unknown
Key Investors
OpenAIDaniel Loeb
2025-10-06Post Ipo Equity
2023-03-02Post Ipo Equity
2021-06-29Post Ipo Equity

Leadership Team

leader-logo
Lisa Su
Chair & CEO
linkedin
leader-logo
Mark Papermaster
CTO and EVP
linkedin
Company data provided by crunchbase