SemiAnalysis · 1 week ago
GPU Cloud Architect
SemiAnalysis is an independent research and analysis firm specializing in the Semiconductor and AI industries. The GPU Cloud Architect will lead the development of GPU Cloud benchmarks and establish strategic partnerships within the neocloud and AI chip manufacturing sectors.
Artificial Intelligence (AI)ConsultingSemiconductor
Responsibilities
Lead the development of the next generation of our industry leading ClusterMAX™ GPU Cloud benchmark
Develop & operate dozens of GPU clusters including GB200 NVL72, TPUv7, H200, Mi355, etc
Develop GPU Cloud benchmarks such as Storage IO & bandwidth benchmarks, PyTorch MLPerf benchmarks, etc
Author detailed technical research reports analyzing benchmark results, GPU Cloud performance, scalability, & efficiency
Establish and maintain strategic partnerships & collaborations with over 50 leading neocloud providers & AI chip manufacturers, including AMD, NVIDIA, and other industry stakeholders
Qualification
Required
Experience working at a hyperscaler or a GPU cloud
1-2 years using GPU or TPU clusters and/or running a multi-tenant GPU cluster
Solid understanding of SLURM, & Kubernetes
Practical experience in InfiniBand, NCCL, Fabric Manager, PyTorch, etc
Strong research skills and the ability to synthesize information from various sources to draw insights
Benefits
Generous PTO
Office stipends
Competitive healthcare (medical, dental, vision)
Support for conferences and ongoing learning
Company
SemiAnalysis
SemiAnalysis offers AI and semiconductor research, consulting, and hosts tech events like Nvidia Blackwell GPU Hackathon.
H1B Sponsorship
SemiAnalysis has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
Funding
Current Stage
Early StageRecent News
2025-11-08
Company data provided by crunchbase