GMI Cloud · 10 hours ago
Senior Solution Architect – AI / GPU Cloud
GMI Cloud is a fast-growing AI infrastructure company backed by Headline VC and recognized as an NVIDIA Reference Platform Cloud Partner. The Senior Solution Architect will serve as the primary technical interface for enterprise and hyperscaler accounts, designing GPU-cloud and AI infrastructure solutions while guiding customers through deployment and ensuring successful delivery.
Responsibilities
Serve as the primary technical point-of-contact for enterprise and hyperscaler customers
Deeply understand customer AI/ML/HPC workloads, scaling requirements, and deployment models
Architect GPU clusters, storage, networking, and orchestration solutions tailored to customer needs
Lead Proof-of-Concepts, benchmarks, and workshops demonstrating performance, reliability, and scalability
Produce technical proposals, architecture diagrams, capacity plans, and cost/performance recommendations
Translate complex technical issues into clear actions for both engineering and business stakeholders
Guide customers through onboarding, cluster setup, performance tuning, and scaling
Partner with internal Infra, DC Ops, and Engineering teams to ensure smooth delivery and implementation
Identify optimization opportunities in customer workloads (GPU utilization, networking, scheduling, cost)
Act as a trusted advisor on GPU/AI infrastructure best practices, roadmap, and long-term planning
Maintain regular technical check-ins, capacity reviews, and performance reviews with customers
Gather customer feedback and collaborate with product/engineering to improve our platform
Qualification
Required
5–10+ years in cloud infrastructure, GPU cloud, HPC, AI/ML infrastructure, or data center engineering
Strong understanding of distributed training & inference architectures
Strong understanding of Kubernetes, Slurm, or other cluster/orchestration systems
Strong understanding of NVIDIA GPU stack (H100/H200/B200/GB200 or similar)
Strong understanding of InfiniBand / high-speed networking
Strong understanding of storage architectures for AI workloads
Experience working directly with enterprise or hyperscaler technical teams
Ability to simplify complex infra concepts for both technical and non-technical audiences
Strong communication, solution-design, and project coordination skills
Self-starter, ownership mindset, excellent follow-through
Comfortable working in a fast-moving, high-growth environment
Strong problem-solving and 'architect + advisor' mentality
Preferred
Hands-on with large-scale GPU deployments (multi-node, multi-cluster)
Exposure to hyperscaler capacity planning or AI infrastructure procurement teams
Experience with multi-region or global GPU deployments (US + APAC/Taiwan)
Company
GMI Cloud
GMI Cloud provides GPU cloud access for generative AI applications.
H1B Sponsorship
GMI Cloud has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
Funding
Current Stage
Growth StageTotal Funding
$82MKey Investors
Headline Asia (formerly Infinity Ventures)Banpu NEXT
2024-10-29Series A· $15M
2024-10-29Debt Financing· $67M
2024-07-16Corporate Round
Recent News
Morningstar.com
2025-11-20
2025-11-19
Company data provided by crunchbase