AMD · 15 hours ago
Product Application Engineer - Data Center Deployment
AMD is a company focused on building innovative products to enhance computing experiences across various domains. The Product Application Engineer will serve as a key technical resource for data center cluster projects, providing guidance and support to customers while collaborating with internal teams to ensure successful deployments and operations.
AI InfrastructureArtificial Intelligence (AI)Cloud ComputingComputerEmbedded SystemsGPUHardwareSemiconductor
Responsibilities
Develop a strong understanding of the client’s business to assist with ensuring an impactful and effective task completion in bringup and validation of CSP and customer clusters
Provide technical guidance and support at an advisory level to customers for server clusters, focused on large scale GPU deployments
Build out datacenter GPU cluster environments for customer testing and deployment
Assist development teams in identifying and resolving hardware/software technical issues throughout the cluster lifecycle, from initial bring-up to entering service for running workloads
Provide technical guidance to internal teams based on customer feedback
Qualify and assess new cluster automation software functionality to ensure compatibility with customer requirements and datacenters
Resolve technical issues for customers utilizing AMD Instinct™ server products in clusters
Mentor junior members of the technical staff
Follow procedures to communicate, report, and escalate incidents to AMD management
Collaborate with program managers to maintain project schedules, track action items, ensure deliverables are met, and provide project status updates to customers and AMD management
Qualification
Required
Exceptional skills in AI GPU hardware, software, systems management, and networking, especially with high-speed data fabrics
Strong communication skills, with the ability to tactfully interface with both technical and program management resources at CSP and customer sites
Highly analytical, detail-oriented, self-motivated, and maintain a positive, results-driven attitude
Work closely with customers in an advisory role to provide guidance during large scale cluster bringup and validation
Experience in large scale system bringup
Strong customer focus
Self-motivated and capable of working effectively within a team environment
Communicating concisely at all levels within an organization
Bachelor's degree in Computer or Electrical Engineering, Master's preferred
Preferred
Data center customer support hands-on management roles during cluster system bringup
Advisory PM and technical roles in large-scale data center cluster bringup
Data center customer support tool skills using automation tools and frameworks such as Ansible, bash, Python and others
Bringup of data center servers and racks, server architecture and functionality, including remote management via BMC, network topologies, and graphics software/hardware subsystems
Linux installation, setup, usage, tuning, and debugging
Virtual environments (e.g., VMWare, Citrix, KVM, Microsoft) and virtual machine setup/management
Familiarity with datacenter GPU software stacks such as AMD ROCm™ or Nvidia CUDA
Familiarity with distributed network libraries (e.g., NCCL/RCCL, MPI) with GPU accelerators in distributed memory systems and high-speed network protocols/topologies
Strong skill in high-performance fabrics for HPC and AI, such as RDMA/RoCE and InfiniBand
Some familiarity with AI and machine learning workloads, frameworks, and models
Strong debugging, problem-solving, and analysis skills
Strong verbal and written communication skills for conveying technical information
Self-starter with attention to detail, organizational skills, and the ability to multitask in a fast-paced environment
May require up to 20% travel
Benefits
AMD benefits at a glance.
Company
AMD
Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions.
Funding
Current Stage
Public CompanyTotal Funding
unknownKey Investors
OpenAIDaniel Loeb
2025-10-06Post Ipo Equity
2023-03-02Post Ipo Equity
2021-06-29Post Ipo Equity
Recent News
GlobeNewswire
2026-01-21
2026-01-20
2026-01-19
Company data provided by crunchbase