Advanced Microdevices Pvt. Ltd. (India) · 2 days ago
Software Development Engineer – SGLang
Advanced Micro Devices, Inc is dedicated to building innovative products that enhance computing experiences across various domains. The Software Development Engineer will focus on optimizing and developing deep learning frameworks for AMD GPUs, collaborating with various teams to improve performance and contribute to the AI software ecosystem.
BiopharmaBiotechnologyIndustrialManufacturing
Responsibilities
Optimize Deep Learning Frameworks: Enhance performance of frameworks like TensorFlow, PyTorch, and SGLang on AMD GPUs via upstream contributions in open-source repositories
Develop and Optimize Deep Learning Models: Profile and tune large-scale training and inference models for optimal performance on AMD hardware
GPU Kernel Development: Design, implement, and optimize high-performance GPU kernels using HIP, Triton, or other relevant tools for AI operator efficiency
Collaborate with GPU Library and Compiler Teams: Work closely with internal compiler and GPU math library teams to integrate and align kernel-level optimizations with full-stack performance goals
Contribute to SGLang Development: Support optimization, feature development, and scaling of the SGLang LLM framework across AMD GPU platforms
Distributed System Optimization: Tune and scale performance across both multi-GPU (scale-up) and multi-node (scale-out) environments, including inference parallelism and collective communication strategies
Graph Compiler Integration: Integrate and optimize runtime execution through graph compilers such as XLA, TorchDynamo, or custom pipelines
Open-Source Collaboration: Partner with external maintainers to understand framework needs, propose optimizations, and upstream contributions effectively
Apply Engineering Best Practices: Leverage modern software engineering practices in debugging, profiling, test-driven development, and CI integration
Qualification
Required
Skilled engineer with strong technical and analytical expertise in C++ development within Linux environments
Ability to thrive in both collaborative team settings and independent work
Ability to define goals, manage development efforts, and deliver high-quality solutions
Strong problem-solving skills
Proactive approach
Keen understanding of software engineering best practices
Optimize Deep Learning Frameworks: Enhance performance of frameworks like TensorFlow, PyTorch, and SGLang on AMD GPUs via upstream contributions in open-source repositories
Develop and Optimize Deep Learning Models: Profile and tune large-scale training and inference models for optimal performance on AMD hardware
GPU Kernel Development: Design, implement, and optimize high-performance GPU kernels using HIP, Triton, or other relevant tools for AI operator efficiency
Collaborate with GPU Library and Compiler Teams: Work closely with internal compiler and GPU math library teams to integrate and align kernel-level optimizations with full-stack performance goals
Contribute to SGLang Development: Support optimization, feature development, and scaling of the SGLang LLM framework across AMD GPU platforms
Distributed System Optimization: Tune and scale performance across both multi-GPU (scale-up) and multi-node (scale-out) environments, including inference parallelism and collective communication strategies
Graph Compiler Integration: Integrate and optimize runtime execution through graph compilers such as XLA, TorchDynamo, or custom pipelines
Open-Source Collaboration: Partner with external maintainers to understand framework needs, propose optimizations, and upstream contributions effectively
Apply Engineering Best Practices: Leverage modern software engineering practices in debugging, profiling, test-driven development, and CI integration
Bachelor's and/or Master's Degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field
Preferred
Strong Programming Skills: Proficient in C++ and/or Python, with demonstrated ability to debug, profile, and optimize performance-critical code
SGLang and LLM Optimization: Hands-on experience with SGLang or similar LLM inference frameworks is highly preferred
Compiler and GPU Architecture Knowledge: Background in compiler design or familiarity with technologies like LLVM, MLIR, or ROCm is a plus
Heterogeneous System Workloads: Experience running and scaling workloads on large-scale, heterogeneous clusters (CPU + GPU) using distributed training or inference strategies
AI Framework Integration: Experience contributing to or integrating optimizations into deep learning frameworks such as PyTorch or TensorFlow
GPU Computing: Working knowledge of HIP, CUDA, or other GPU programming models; experience with GCN/CDNA architecture preferred
Benefits
AMD benefits at a glance.
Company
Advanced Microdevices Pvt. Ltd. (India)
Advanced Microdevices (mdi) is a leader in innovative membrane technologies.
Funding
Current Stage
Late StageLeadership Team
Nalini Kant Gupta
Founder & Managing Director
Recent News
2024-10-18
2024-10-16
Company data provided by crunchbase