CVP FDE – AI Software Development jobs in United States
cer-icon
Apply on Employer Site
company-logo

AMD · 7 hours ago

CVP FDE – AI Software Development

AMD is a company focused on building innovative products that enhance computing experiences across various domains. The role involves leading a world-class FDE organization, optimizing customer deployments, and acting as the voice of the customer to drive engineering roadmaps.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingComputerEmbedded SystemsGPUHardwareSemiconductor
check
Growth Opportunities
badNo H1Bnote

Responsibilities

Cluster Bring-up & Optimization: Oversee the technical onboarding of massive GPU clusters. Ensure your team can troubleshoot collective communication errors, debug framework issues, and optimize training/inference strategies
Utilization Engineering (The North Star Metric): Drive and maintain industry-leading Customer GPU Utilization across clusters of thousands of GPUs, making cluster satisfaction the key measure of success
High-Performance Model Deployment: Enable customer success by deeply optimizing open-sourced models (Llama 3, DeepSeek, Mixtral) and proprietary models for our specific hardware topology, utilizing tools like vLLM and TensorRT-LLM
Executive Technical Sponsorship: Act as the technical authority and executive sponsor on large deals, possessing the credibility to validate architecture with CTOs and VPs of AI
Feedback Loop: Aggressively channel field intelligence back to Product Engineering. If customers are struggling with a specific use case, then become the loudest voice in the room demanding a fix

Qualification

AI FrameworksDistributed ComputingGPU EcosystemLLM OperationsCommercial AcumenTechnical LeadershipHigh-Stakes Crisis ManagementHardware/Software Hybrid

Required

Build and scale a world-class FDE organization, strategically combining ML Generalists, Low-Level Kernel Optimizers, and Solutions Architects to cover the full customer deployment lifecycle
Define and institutionalize the FDE Engagement Model to maximize resource leverage and ensure consistent, high-velocity customer outcomes
Serve as the Voice of the Customer internally: Translate field intelligence and customer challenges into concrete, prioritized engineering roadmaps, and ensure execution
Cluster Bring-up & Optimization: Oversee the technical onboarding of massive GPU clusters. Ensure your team can troubleshoot collective communication errors, debug framework issues, and optimize training/inference strategies
Utilization Engineering (The North Star Metric): Drive and maintain industry-leading Customer GPU Utilization across clusters of thousands of GPUs, making cluster satisfaction the key measure of success
High-Performance Model Deployment: Enable customer success by deeply optimizing open-sourced models (Llama 3, DeepSeek, Mixtral) and proprietary models for our specific hardware topology, utilizing tools like vLLM and TensorRT-LLM
Executive Technical Sponsorship: Act as the technical authority and executive sponsor on large deals, possessing the credibility to validate architecture with CTOs and VPs of AI
Feedback Loop: Aggressively channel field intelligence back to Product Engineering. If customers are struggling with a specific use case, then become the loudest voice in the room demanding a fix
Technical Competency: AI Frameworks: PyTorch, JAX, TensorFlow
Technical Competency: Distributed Computing: Slurm, Ray, Kubernetes (K8s), Docker
Technical Competency: GPU Ecosystem: NVIDIA drivers, CUDA profiling (Nsight Systems), Triton Inference Server
Technical Competency: LLM Operations (Differentiator): Significant experience with advanced LLM deployment and customization techniques, including fine-tuning (e.g., LoRA/QLoRA) and building RAG pipelines
Academic Credentials: BS, MS or equivalent with direct experience

Preferred

Years of Technical Leadership: Demonstrated track record leading high-impact technical teams within high-stakes environments (e.g., Cloud Infrastructure, AI Platform, or HPC)
The 'Hardware/Software' Hybrid: You understand the stack from the metal up
Commercial Acumen & Fluency: Deep understanding of commercial drivers (ARR, Churn, Margin) and the ability to articulate how technical solutions impact deal velocity and business outcomes
High-Stakes Crisis Management: Experience leading through 'Sev0' customer incidents (e.g., massive training run failures), demonstrating the poise and clarity required to manage executive communication while guiding rapid root cause resolution

Benefits

AMD benefits at a glance.

Company

Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions.

Funding

Current Stage
Public Company
Total Funding
unknown
Key Investors
OpenAIDaniel Loeb
2025-10-06Post Ipo Equity
2023-03-02Post Ipo Equity
2021-06-29Post Ipo Equity

Leadership Team

leader-logo
Lisa Su
Chair & CEO
linkedin
leader-logo
Mark Papermaster
CTO and EVP
linkedin
Company data provided by crunchbase