AI Software Development Eng. jobs in United States
cer-icon
Apply on Employer Site
company-logo

Advanced Microdevices Pvt. Ltd. (India) · 9 hours ago

AI Software Development Eng.

Advanced Micro Devices, Inc is a company focused on building innovative products that enhance computing experiences across various domains including AI and data centers. They are seeking a passionate AI Software Development Engineer to work on Distributed Inferencing on AMD GPUs, optimizing performance and scalability of AI workloads while collaborating with a team of specialists.

BiopharmaBiotechnologyIndustrialManufacturing

Responsibilities

Enable and benchmark AI models on large-scale distributed systems to evaluate performance, accuracy, and scalability
Optimize AI workloads across scale-up (multi-GPU), scale-out (multi-node), and scale-across distributed system configurations
Collaborate closely with internal GPU library teams to analyze and optimize distributed workloads for high throughput and low latency
Develop and apply optimal parallelization strategies for AI workloads to achieve best-in-class performance across diverse system configurations
Contribute to distributed model management systems, model zoos, monitoring frameworks, benchmarking pipelines, and technical documentation
Build and maintain real-time dashboards reporting performance, accuracy, and reliability metrics for internal stakeholders and external users

Qualification

C++PythonDistributed SystemsAI FrameworksCluster ManagementCI/CD ToolsQuality AssuranceProblem-SolvingCollaboration

Required

Strong technical expertise in C++/ Python development
Solving performance and investigating scalability on multi-GPU, multi-node clusters
Passionate about quality assurance, benchmarking, and automation in the AI/ML space
Thrives in both collaborative and independent environments
Demonstrates excellent problem-solving skills
Takes ownership in defining goals and delivering impactful solutions
Master's or PhD degree in Computer Science, Computer Engineering, or a related field, or equivalent practical experience

Preferred

AI Framework Engineering: Hands-on experience with AI inference or serving frameworks such as vLLM, SGLang, and Llama.cpp
KV Cache and Expert Parallelization: Understanding KV cache transfer mechanisms and technologies (e.g., Mooncake, NIXL/RIXL) and expert parallelization approaches (e.g., DeepEP, MORI, PPLX-Garden)
Programming and Software Design: Strong C/C++ and Python skills, with experience in software design, debugging, performance analysis, and test development
Large-Scale Distributed Systems: Experience running AI workloads on large-scale, heterogeneous compute clusters
Cluster and Orchestration Systems: Familiarity with cluster management and orchestration platforms such as SLURM and Kubernetes (K8s)
Development Tools and Workflows: Experience with GitHub, Jenkins, or similar CI/CD tools and modern development workflows

Benefits

AMD benefits at a glance.

Company

Advanced Microdevices Pvt. Ltd. (India)

twittertwittertwitter
company-logo
Advanced Microdevices (mdi) is a leader in innovative membrane technologies.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Nalini Kant Gupta
Founder & Managing Director
Company data provided by crunchbase