Sr Software Engineer, Machine Learning Platform Technologies - Cloud Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Apple · 1 day ago

Sr Software Engineer, Machine Learning Platform Technologies - Cloud Infrastructure

Apple is seeking a hands-on technical leader for their MLPT Cloud Infrastructure Team, responsible for designing and scaling cloud-native ML infrastructure. The role involves collaborating across teams to develop automated infrastructure solutions that enhance ML training and inference capabilities at scale.

AppsArtificial Intelligence (AI)BroadcastingConsumer ElectronicsDigital EntertainmentMedia and EntertainmentMobile DevicesOperating SystemsTVWearables
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Architect and develop cloud-native, agentic infrastructure platforms supporting ML training, inference, and large-scale distributed systems
Lead and mentor engineers building Crossplane-based control planes, Kubernetes operators, and ArgoCD-driven GitOps automation
Design, implement, and optimize MCP-based infrastructure servers that contextualize and manage infrastructure and application state across environments
Contribute to CNCF open-source projects and represent Apple in the cloud-native community
Implement observability, governance, and automation frameworks to ensure performance, reliability, security, and compliance
Integrate agentic orchestration workflows for self-service provisioning, ML pipeline management, and dynamic infrastructure scaling
Drive best practices for GitOps, Infrastructure-as-Code, and Kubernetes cluster lifecycle automation at global scale
Ensure systems are resilient, cost-efficient, and optimized for performance across on-prem and multi-cloud environments

Qualification

KubernetesGolangPythonCrossplaneArgoCDModel Context ProtocolDistributed systemsInfrastructure-as-CodeObservabilityTechnical writingCross-functional leadership

Required

BS/MS in Computer Science or equivalent practical experience
5+ years of experience in leading distributed systems or cloud infrastructure engineering
Strong programming experience in Golang and Python, including building controllers, operators, or automation systems
Deep understanding of Kubernetes internals, controller-runtime, and Crossplane composition frameworks
Experience with ArgoCD, Helm, and IaC (Terraform or Crossplane)
Hands-on experience with GitOps and reconciliation-driven workflows
Proven ability to design and operate infrastructure for ML training and inference, including performance tuning and GPU optimization
Experience leading technical teams and driving architectural decisions
Strong grounding in cost efficiency, performance profiling, and system-level debugging

Preferred

9+ years in cloud infrastructure, SRE, or distributed systems roles
Contributions to CNCF open-source projects (Kubernetes, Crossplane, ArgoCD, Envoy, Prometheus, etc.)
Deep expertise in Kubernetes API machinery, CRDs, and control plane development
Experience with Model Context Protocol (MCP) or contextual infrastructure servers
Familiarity with AIOps or agentic/LLM-driven automation in production environments
Strong understanding of observability and distributed tracing (OpenTelemetry, Prometheus, Grafana)
Experience building ML infrastructure platforms (training clusters, inference systems, model registries)
Excellent communication, cross-functional leadership, and technical writing skills
B.S., M.S., or Ph.D. in Computer Science, Computer Engineering, or equivalent practical experience is preferred

Company

Apple is a technology company that designs, manufactures, and markets consumer electronics, personal computers, and software.

H1B Sponsorship

Apple has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (6998)
2024 (3766)
2023 (3939)
2022 (4822)
2021 (4060)
2020 (3656)

Funding

Current Stage
Public Company
Total Funding
$5.67B
Key Investors
Berkshire HathawayMicrosoftSequoia Capital
2025-05-05Post Ipo Debt· $4.5B
2025-01-16Post Ipo Debt· $0.31M
2021-04-30Post Ipo Equity

Leadership Team

leader-logo
Tim Cook
CEO
leader-logo
Craig Federighi
SVP, Software Engineering
Company data provided by crunchbase