Be an early applicantLess than 25 applicants

Company

NVIDIA · 3 hours ago

Solutions Architect, Generative AI - Inference

Washington, United States

Full-time

Remote

Senior Level

$148K/yr - $230K/yr

5+ years exp

Maximize your interview chances

Artificial Intelligence (AI)GPU

Growth Opportunities

H1B Sponsor Likely

Insider Connection @NVIDIA

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions

Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas

Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform

Working closely with customers to help them adopt and build solutions using NVIDIA technology

Analyze performance and power efficiency of deep learning inference workloads

Some travel to conferences and customers may be required

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Deep LearningPyTorchTensorFlowLarge Language ModelsPythonNVIDIA GPUsNVIDIA NeMo FrameworkNVIDIA Triton Inference ServerTensorRTC/C++DebuggingPerformance AnalysisSoftware DesignParallel ProgrammingDistributed ComputingCollaboration Skills

Required

BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)

5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow

Strong fundamentals in programming, optimizations and software design, especially in Python

Strong problem-solving and debugging skills

Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference

Excellent presentation, communication and collaboration skills

Desire to be involved in multiple diverse and creative projects

Preferred

Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM

Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design

Familiarity with parallel programming and distributed computing platforms

Prior experience with DL training at scale, deploying or optimizing DL inference in production

Benefits

Equity

Benefits

Company

NVIDIA

Glassdoor

4.6

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

Founded in 1993

Santa Clara, California, USA

10,001+ employees

https://www.nvidia.com

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2023 (735)

2022 (892)

2021 (696)

2020 (534)