Solutions Architect, Generative AI - Inference @ NVIDIA | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
Solutions Architect, Generative AI - Inference jobs in Washington, United States
Be an early applicantLess than 25 applicants
company-logo

NVIDIA · 3 hours ago

Solutions Architect, Generative AI - Inference

ftfMaximize your interview chances
Artificial Intelligence (AI)GPU
check
Growth Opportunities
check
H1B Sponsor Likelynote

Insider Connection @NVIDIA

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions
Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas
Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform
Working closely with customers to help them adopt and build solutions using NVIDIA technology
Analyze performance and power efficiency of deep learning inference workloads
Some travel to conferences and customers may be required

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Deep LearningPyTorchTensorFlowLarge Language ModelsPythonNVIDIA GPUsNVIDIA NeMo FrameworkNVIDIA Triton Inference ServerTensorRTC/C++DebuggingPerformance AnalysisSoftware DesignParallel ProgrammingDistributed ComputingCollaboration Skills

Required

BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)
5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow
Strong fundamentals in programming, optimizations and software design, especially in Python
Strong problem-solving and debugging skills
Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference
Excellent presentation, communication and collaboration skills
Desire to be involved in multiple diverse and creative projects

Preferred

Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM
Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design
Familiarity with parallel programming and distributed computing platforms
Prior experience with DL training at scale, deploying or optimizing DL inference in production

Benefits

Equity
Benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (735)
2022 (892)
2021 (696)
2020 (534)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity· Undisclosed

Leadership Team

leader-logo
Jensen Huang
CEO and Founder
linkedin
leader-logo
Chris Malachowsky
Co-Founder, SVP
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot