Apple · 1 week ago
AIML - ML Infrastructure Engineer, ML Platform & Technology - ML Compute
Apple is a company that values individual imaginations and innovation. They are seeking a Senior/Staff Engineer to design and scale the scheduling and orchestration systems for large-scale foundation model training and inference workloads across TPU clusters, driving innovations in resource management and platform reliability.
AppsArtificial Intelligence (AI)BroadcastingDigital EntertainmentFoundational AIMedia and EntertainmentMobile DevicesOperating SystemsTVWearables
Responsibilities
Lead the design and evolution of the scheduling platform that manages large-scale TPU workloads across multi-region clusters, supporting both training and inference
Develop topology-aware and quota-aware schedulers to improve cluster utilization, job latency, and fairness
Collaborate with Apple Foundation Model team to integrate advanced distributed computing frameworks (Pathways, Ray, Beam, JetStream) into the platform or expose them as reliable, scalable services
Automate complex operational workflows for quota updates, job lifecycle management, and resource provisioning to reduce on-call and dev-ops overhead
Mentor engineers and partner across teams to influence the direction of Apple’s large-scale distributed compute strategy
Qualification
Required
Bachelors in Computer Science, engineering, or a related field
Experience with foundation model training and inference workloads across TPU clusters
5+ years of hands-on experience in building scalable backend systems for training and evaluation of machine learning models
Proficient in relevant programming languages, like Python or Go
Strong expertise in distributed systems, reliability and scalability, containerization, and cloud platforms
Proficient in cloud computing infrastructure and tools: Kubernetes, Ray, PySpark
Ability to clearly and concisely communicate technical and architectural problems, while working with partners to iteratively find
Preferred
Advance degrees in Computer Science, engineering, or a related field
Proficient in working with and debugging accelerators, like: GPU, TPU, AWS Trainium
Proficient in ML training and deployment frameworks, like: JAX, Tensorflow, PyTorch, TensorRT, vLLM
Benefits
Comprehensive medical and dental coverage
Retirement benefits
A range of discounted products and free services
Reimbursement for certain educational expenses — including tuition
Discretionary bonuses or commission payments
Relocation
Company
Apple
Apple is a technology company that designs, manufactures, and markets consumer electronics, personal computers, and software.
H1B Sponsorship
Apple has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (6998)
2024 (3766)
2023 (3939)
2022 (4822)
2021 (4060)
2020 (3656)
Funding
Current Stage
Public CompanyTotal Funding
$5.67BKey Investors
Berkshire HathawayMicrosoftSequoia Capital
2025-05-05Post Ipo Debt· $4.5B
2025-01-16Post Ipo Debt· $0.31M
2021-04-30Post Ipo Equity
Leadership Team
Tim Cook
CEO
Craig Federighi
SVP, Software Engineering
Recent News
Venrock
2025-12-01
2025-09-25
Mac Daily News
2025-09-25
Company data provided by crunchbase