GPU Software Architecture Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Apple · 2 months ago

GPU Software Architecture Engineer

Apple is seeking a senior/principal engineer to lead server-side ML acceleration and multi-node distribution initiatives. The role involves architecting and building next-generation distributed ML infrastructure, optimizing performance for real-time user experiences, and collaborating with silicon architects to influence future GPU designs.

AppsArtificial Intelligence (AI)BroadcastingDigital EntertainmentFoundational AIMedia and EntertainmentMobile DevicesOperating SystemsTVWearables
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Design and implement tensor/data/expert parallelism strategies for large language model inference across distributed server cluster environments
Drive hardware and software roadmap decisions for ML acceleration
Expert in designing architectures that achieves peak compute utilizations and optimal memory throughput
Develop and optimize distributed inference systems with focus on latency, throughput, and resource efficiency across multiple nodes
Architect scalable ML serving infrastructure supporting dynamic model sharding, load balancing, and fault tolerance
Collaborate with hardware teams on next-generation accelerator requirements and software teams on framework integration
Lead performance analysis and optimization of ML workloads, identifying bottlenecks in compute, memory, and network subsystems
Drive adoption of advanced parallelization techniques including pipeline parallelism, expert parallelism, and various other emerging approaches

Qualification

GPU programmingHigh-performance computingDistributed systemsParallel computing architecturesC/C++ programmingInter-node communicationTensor frameworksPythonModel development lifecycleML infrastructure

Required

Strong knowledge of GPU programming (CUDA, ROCm) and high-performance computing
Must have excellent system programming skills in C/C++, Python is a plus
Deep understanding of distributed systems and parallel computing architectures
Experience with inter-node communication technologies (InfiniBand, RDMA, NCCL) in the context of ML training/inference
Understand how tensor frameworks (PyTorch, JAX, TensorFlow) are used in distributed training/inference
Technical BS/MS degree

Preferred

Familiar with model development lifecycle from trained model to large scale production inference deployment
Proven track record in ML infrastructure at scale

Benefits

Comprehensive medical and dental coverage
Retirement benefits
A range of discounted products and free services
Reimbursement for certain educational expenses — including tuition
Discretionary bonuses or commission payments
Relocation

Company

Apple is a technology company that designs, manufactures, and markets consumer electronics, personal computers, and software.

H1B Sponsorship

Apple has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (6998)
2024 (3766)
2023 (3939)
2022 (4822)
2021 (4060)
2020 (3656)

Funding

Current Stage
Public Company
Total Funding
$5.67B
Key Investors
Berkshire HathawayMicrosoftSequoia Capital
2025-05-05Post Ipo Debt· $4.5B
2025-01-16Post Ipo Debt· $0.31M
2021-04-30Post Ipo Equity

Leadership Team

leader-logo
Tim Cook
CEO
leader-logo
Craig Federighi
SVP, Software Engineering
Company data provided by crunchbase