Senior ML/AI DevOps Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

AMD · 11 hours ago

Senior ML/AI DevOps Engineer

AMD is a company focused on building great products for next-generation computing experiences, including AI and data centers. They are seeking a Senior ML/AI DevOps Engineer to drive automated infrastructure deployment and CI/CD design for their AI software and datacenter platforms, ensuring reliable and secure infrastructure to support evolving AI initiatives.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingComputerEmbedded SystemsGPUHardwareSemiconductor
check
Growth Opportunities
badNo H1Bnote

Responsibilities

Create resilient automation pipelines, orchestrate Kubernetes-based environments, and ensure seamless integration of diverse software components
Design CI/CD pipelines, automate bare-metal-to-Kubernetes bring-up, deploy microservices with Helm, and integrate security and static code analysis tools
Build monitoring systems and automated alerts, diagnose and resolve complex build failures, and collaborate with teams across the organization to ensure validation readiness for AI solutions
Play a critical role in advancing AMD’s infrastructure, scaling enterprise deployment workflows, refining automation architectures, and enabling rapid iteration across the Software and Solutions organization

Qualification

DevOps engineeringInfrastructure automationKubernetesPython automationCI/CD frameworksDockerLinux administrationSystems integrationNetworking fundamentalsObservability toolsCollaboration skills

Required

Deep technical expertise in infrastructure automation, cloud-native systems, and DevOps engineering
Strong cross-functional collaboration skills
Create resilient automation pipelines
Orchestrate Kubernetes-based environments
Ensure seamless integration of diverse software components
Design CI/CD pipelines
Automate bare-metal-to-Kubernetes bring-up
Deploy microservices with Helm
Integrate security and static code analysis tools
Build monitoring systems and automated alerts
Diagnose and resolve complex build failures
Collaborate with teams across the organization to ensure validation readiness for AI solutions
Play a critical role in advancing AMD's infrastructure
Scale enterprise deployment workflows
Refine automation architectures
Enable rapid iteration across the Software and Solutions organization
BS, MS, or PhD in Computer Science or a related equivalent + 6 Years of applicable experience

Preferred

Extensive expertise in Python automation, CI/CD frameworks (Jenkins, Terraform, Ansible), and Kubernetes/Helm-based microservice deployment
Strong experience with Docker container development, GitHub Actions, Linux system administration, and networking fundamentals (PXE, IPMI, switching/routing)
A solid understanding of observability tools such as Prometheus, Grafana, or ELK/Kibana
Best practices in secure development and scalable automation

Benefits

AMD benefits at a glance.

Company

Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions.

Funding

Current Stage
Public Company
Total Funding
unknown
Key Investors
OpenAIDaniel Loeb
2025-10-06Post Ipo Equity
2023-03-02Post Ipo Equity
2021-06-29Post Ipo Equity

Leadership Team

leader-logo
Lisa Su
Chair & CEO
linkedin
leader-logo
Mark Papermaster
CTO and EVP
linkedin
Company data provided by crunchbase