SIGN IN
Cloud Services Provider Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

AMD · 18 hours ago

Cloud Services Provider Engineer

AMD is a company focused on building innovative products that enhance next-generation computing experiences. The role involves managing Cloud Service Provider (CSP) resource allocations, onboarding, and contract-driven capacity operations, ensuring operational continuity and cross-organizational alignment.
AI InfrastructureArtificial Intelligence (AI)Cloud ComputingComputerEmbedded SystemsGPUHardwareSemiconductor
check
Growth Opportunities
badNo H1Bnote

Responsibilities

CSP Allocation Ownership
Serve as the single-threaded owner for all CSP allocations across the Fleet
Maintain and operate the central allocations tracker, ensuring accuracy and alignment with CSP partners and internal teams
Manage prioritization for backstop capacity, migrations, and fast-moving use cases, supporting escalations raised by allocation forums
Maintain full understanding of all active CSP contracts, capacity SLAs, renewals, expiry schedules, and online/offline transitions
Anticipate and track contract-driven node loss (daily/weekly) and support proactive mitigation plans
Partner with Finance and Planning teams to align on capacity modeling impacts and operational continuity
Collaborate closely with Fleet Management, AIG‑SW, TPMs, Program Management, and Datacenter Ops teams
Attend allocation forums, migration meetings, and outage readiness sessions to ensure CSP capacity is represented and downstream impacts are well understood
Identify CSP-related operational risks early and develop mitigation plans
Provide continuity of coverage across evolving CSP-related demands and future scaling
Reduce reliance on ad-hoc staffing from AIG-SW, ensuring Fleet-side autonomy in managing CSP operations

Qualification

Cloud operationsCSP contractsInfrastructure scalingOperational toolingAnalytical skillsCommunication skillsProblem-solving skillsSelf-starterCollaboration

Required

CSP Allocation Ownership
Serve as the single-threaded owner for all CSP allocations across the Fleet
Maintain and operate the central allocations tracker, ensuring accuracy and alignment with CSP partners and internal teams
Manage prioritization for backstop capacity, migrations, and fast-moving use cases, supporting escalations raised by allocation forums
Maintain full understanding of all active CSP contracts, capacity SLAs, renewals, expiry schedules, and online/offline transitions
Anticipate and track contract-driven node loss (daily/weekly) and support proactive mitigation plans
Partner with Finance and Planning teams to align on capacity modeling impacts and operational continuity
Collaborate closely with Fleet Management, AIG‑SW, TPMs, Program Management, and Datacenter Ops teams
Attend allocation forums, migration meetings, and outage readiness sessions to ensure CSP capacity is represented and downstream impacts are well understood
Identify CSP-related operational risks early and develop mitigation plans
Provide continuity of coverage across evolving CSP-related demands and future scaling
Reduce reliance on ad-hoc staffing from AIG-SW, ensuring Fleet-side autonomy in managing CSP operations
Bachelors or Masters degree in engineering

Preferred

Cloud operations, allocation management, datacenter operations, or equivalent technical program management role
Strong understanding of cloud capacity models, CSP contracts, and infrastructure scaling considerations
Experience running cross-functional operational processes in a fast-moving environment
Ability to manage ambiguity, handle multi-stakeholder alignment, and respond quickly to capacity or contract-driven escalations
Expertise with operational tooling and tracking systems (Jira, Conductor, allocation dashboards, Snowflake, etc.)
Strong analytical/problem-solving skills and pronounced attention to details
Must be a self-starter, and able to independently drive tasks to completion
Prior experience working with AI/ML workloads, GPU fleet operations, or CSP backstop capacity
Familiarity with Kubernetes, SLURM, and non-standard workloads requiring manual oversight
Direct experience partnering with TPMs and cross‑functional engineering teams
Excellent communication skills and ability to present complex operational scenarios clearly to senior leadership

Benefits

AMD benefits at a glance.

Company

Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions.

Funding

Current Stage
Public Company
Total Funding
unknown
Key Investors
OpenAIDaniel Loeb
2025-10-06Post Ipo Equity
2023-03-02Post Ipo Equity
2021-06-29Post Ipo Equity

Leadership Team

leader-logo
Lisa Su
Chair & CEO
linkedin
leader-logo
Mark Papermaster
CTO and EVP
linkedin
Company data provided by crunchbase