Machine Learning Engineer, GeForce G-Assist jobs in United States
cer-icon
Apply on Employer Site
company-logo

NVIDIA · 1 day ago

Machine Learning Engineer, GeForce G-Assist

NVIDIA is building GeForce G-Assist, an on-device AI assistant that integrates Small Language Models and hybrid cloud capabilities. The role involves evaluating and improving AI models, optimizing performance, and designing systems for contextual assistance within the GeForce ecosystem.

AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Evaluate and improve Small Language Models used in GeForce G-Assist, with an emphasis on accuracy, robustness, and conversational reliability
Identify and mitigate conversation and context contamination, including state drift, prompt leakage, and retrieval cross-talk
Work with SLM and VLM architectures to support text and multimodal interactions
Collaborate on hybrid architectures that combine local SLMs with cloud-based models
Optimize local inference using llama.cpp, including quantization, memory usage, and performance tuning
Read, write, and optimize C/C++ code in performance-critical paths
Design and integrate retrieval-augmented generation (RAG) systems that ground responses in system and user context
Support agentic AI workflows, enabling planning, tool use, and multi-step execution

Qualification

C/C++ programmingPythonSmall Language ModelsLlama.cppRetrieval technologiesVLM architecturesUser feedback translationCollaborationProblem-solving

Required

8+ years of validated experience in system software or a related field, with an M.S. or higher degree in Computer Science, Data Science, Engineering, or a related field (or equivalent experience)
Strong ability to read and write C/C++ code in systems-level or performance-sensitive environments, along with proficiency in Python
Hands-on experience with llama.cpp or similar local inference frameworks
Hands-on experience evaluating Small Language Models, including task-based and conversational testing, with an understanding of conversation dynamics, long-context behavior, and contamination challenges
Knowledge of SLM and VLM architectures and their trade-offs, experience with retrieval technologies and language-model integration, and familiarity with agentic AI patterns such as tool use and planning

Preferred

Experience contributing to language or multimodal models that power user-facing products, features, or workflows
A track record of collaborating with product, platform, or systems teams to balance model capability, performance, and user experience
Demonstrated ability to translate user needs or feedback into measurable improvements in model behavior or system reliability

Benefits

Equity
Benefits

Company

NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.

H1B Sponsorship

NVIDIA has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)

Funding

Current Stage
Public Company
Total Funding
$4.09B
Key Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity

Leadership Team

leader-logo
Jensen Huang
Founder and CEO
linkedin
leader-logo
Michael Kagan
Chief Technology Officer
linkedin
Company data provided by crunchbase