NVIDIA · 5 hours ago
Solutions Architect, AI Hyperscalers
NVIDIA has been transforming computer graphics and accelerated computing for over 25 years, and they are currently seeking an AI/ML Solutions Architect focusing on Hyperscale customers and Cloud Service Providers. The role involves leading software customer technical engagement for AI training, inference, and infrastructure at scale, while ensuring successful deployments and providing innovative solutions.
AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
Responsibilities
As a key technical member of a focused account team, you will serve as the main point of contact for NVIDIA products, enabling internet giants and cloud providers to have an innovative AI/ML software infrastructure
Work directly with best-in-class engineering teams to secure design wins, address challenges, bring solutions to production, and support them throughout their lifecycle
Become a trusted advisor to your customer by understanding their environment, constraints, and long-term strategy. Translate these insights into product requirements and innovative solutions
Help your customer enhance the value of NVIDIA technology, and provide feedback to NVIDIA for future product improvements
Facilitate the resolution of customer issues, offering timely and proactive communications to mitigate risks
Lead workshops, demos, and proof-of-concepts to showcase NVIDIA’s AI/ML capabilities
Guide customers on standard processes for scalable AI model deployment and inference optimization
Qualification
Required
Minimum of a BS/MS in Computer Science, Electrical Engineering, or equivalent experience
4+ years of engineering experience with a proven track record in AI/ML-focused projects or enterprise-grade solutions
Proven understanding of Linux, including solving, optimization, and customization for AI/ML workloads
Strong understanding of data science and machine learning infrastructure—software and hardware
Professional-level communication skills, including the ability to tailor messages for varying technical audiences and maintain composure in high-pressure situations
Excellent follow-up and interpersonal skills, with a true passion for problem-solving
Proficient in Python, with the ability to develop scripts and build custom tools. Experience with parallel programming or GPU acceleration (e.g., CUDA) is helpful
Shown eagerness to learn and apply new technologies
Preferred
Experience with Chatbots, RAG pipelines, vector databases, and distributed training or inference workloads
Experience or background in HPC (High Performance Computing) environments for AI or ML applications
Familiarity with multi-node GPU clusters and performance tuning for large-scale AI workloads
Experience developing in cloud and/or virtualized environments, containerized solutions, with knowledge of Docker, Kubernetes
Background with common deep learning frameworks such as PyTorch or JAX
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
Company data provided by crunchbase