Software Development Engineer - Kernel Development jobs in United States
info-icon
This job has closed.
company-logo

AMD · 1 week ago

Software Development Engineer - Kernel Development

AMD is a company focused on building innovative products that enhance computing experiences across various sectors. The role involves optimizing and developing deep learning frameworks for AMD GPUs, requiring collaboration with internal teams and open-source maintainers to improve performance and integration.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingComputerEmbedded SystemsGPUHardwareSemiconductor
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Optimize Deep Learning Frameworks: Enhance and optimize frameworks like PyTorch / vllm / SGLang for AMD GPUs in open-source repositories
Design and scale: multi‑GPU inference strategies (TP/PP/EP hybrid). Develop & Optimize Models: Design and optimize deep learning models specifically for AMD GPU performance
Collaborate with GPU Library Teams: Work closely with internal teams to analyze and improve training and inference performance on AMD GPUs
Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream
Work in Distributed Computing Environments: Optimize deep learning performance on both scale-up (multi-GPU) and scale-out (multi-node) systems
Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance
Optimize Deep Learning Pipeline: Enhance the full pipeline, including integrating graph compilers
Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions

Qualification

PythonGPU kernel developmentDeep learning frameworksC++Distributed computingCompiler optimizationPerformance tuningAnalytical skillsProblem-solvingCollaboration

Required

Skilled engineer with strong technical and analytical expertise in Python development within Linux environments
Ability to thrive in both collaborative team settings and independent work
Ability to define goals, manage development efforts, and deliver high-quality solutions
Strong problem-solving skills
Proactive approach
Keen understanding of software engineering best practices
Optimize Deep Learning Frameworks: Enhance and optimize frameworks like PyTorch / vllm / SGLang for AMD GPUs in open-source repositories
Design and scale: multi‑GPU inference strategies (TP/PP/EP hybrid)
Develop & Optimize Models: Design and optimize deep learning models specifically for AMD GPU performance
Collaborate with GPU Library Teams: Work closely with internal teams to analyze and improve training and inference performance on AMD GPUs
Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream
Work in Distributed Computing Environments: Optimize deep learning performance on both scale-up (multi-GPU) and scale-out (multi-node) systems
Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance
Optimize Deep Learning Pipeline: Enhance the full pipeline, including integrating graph compilers
Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions
Master's or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field

Preferred

Strong background in GPU kernel development and LLM inference frameworks
Hands-on understanding of SGLang internals or similar stacks such as vLLM and FasterTransformer
Solid experienced with distributed inference scaling and proven contributor to upstream open-source projects
Strong and significant experience in integrating optimized GPU performance into machine learning frameworks (e.g., TensorFlow, PyTorch) to accelerate model training and inference, with a focus on scaling and throughput
Expert in Python and C++, with experience in debugging, performance tuning, and test design to ensure high-quality, maintainable software solutions
Solid experienced in running large-scale workloads on heterogeneous compute clusters, optimizing for efficiency and scalability
Foundational understanding of compiler theory and tools like LLVM and ROCm for kernel and system performance optimization

Benefits

AMD benefits at a glance.

Company

Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions.

H1B Sponsorship

AMD has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (836)
2024 (770)
2023 (551)
2022 (739)
2021 (519)
2020 (547)

Funding

Current Stage
Public Company
Total Funding
unknown
Key Investors
OpenAIDaniel Loeb
2025-10-06Post Ipo Equity
2023-03-02Post Ipo Equity
2021-06-29Post Ipo Equity

Leadership Team

leader-logo
Lisa Su
Chair & CEO
linkedin
leader-logo
Mark Papermaster
CTO and EVP
linkedin
Company data provided by crunchbase