Principal Engineer – LLM Serving (Cloud AI) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Qualcomm · 1 month ago

Principal Engineer – LLM Serving (Cloud AI)

Qualcomm Technologies, Inc. is developing innovative software solutions for Inference Acceleration as part of their Cloud AI team. They are looking for a Principal Engineer to oversee the entire product life cycle from R&D to deployment, focusing on large commercial software projects and requiring strong skills in machine learning and software performance optimization.

Artificial Intelligence (AI)Generative AISoftwareTelecommunicationsWireless
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Proven ability of planning, managing and deliver large commercial software projects
Experience in serving frameworks, like vLLM
Strong development skills in PyTorch
Strong understanding of LLMs, Multi-modal and reasoning models
Experience in executing, analyzing, and optimizing neural networks
Experience in writing high performance software for multicore systems
Experience with C++, Python
Strong skills in analyzing performance of software/hardware solutions on multi-core architectures; understanding of multi-core architecture fundamentals (core, cache, memory, bus, PCIe, etc)
Understanding of multi-core processor architecture and SoC architectures (NoCs, caches, memories, etc.)
Experience with Performance modeling of SoC architectures
Excellent communication skills (written and verbal) and team player
Experience with machine learning accelerators and related software is highly desired
Background and understanding of neural network operators and mathematical operations: linear algebra, math libraries, desirable

Qualification

LLM Serving frameworksPyTorchNeural network optimizationC++Performance modelingMulti-core architectureSoftware performance analysisMachine learning acceleratorsLinear algebraCommunication skillsTeam player

Required

Proven ability of planning, managing and deliver large commercial software projects
Experience in serving frameworks, like vLLM
Strong development skills in PyTorch
Strong understanding of LLMs, Multi-modal and reasoning models
Experience in executing, analyzing, and optimizing neural networks
Experience in writing high performance software for multicore systems
Experience with C++, Python
Strong skills in analyzing performance of software/hardware solutions on multi-core architectures; understanding of multi-core architecture fundamentals (core, cache, memory, bus, PCIe, etc)
Understanding of multi-core processor architecture and SoC architectures (NoCs, caches, memories, etc.)
Experience with Performance modeling of SoC architectures
Excellent communication skills (written and verbal) and team player
Master's, Computer Engineering and/or Computer Networks & Systems and/or Computer Science
Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 8+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience
Master's degree in Computer Science, Engineering, Information Systems, or related field and 7+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience
PhD in Computer Science, Engineering, Information Systems, or related field and 6+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience

Preferred

Experience with machine learning accelerators and related software is highly desired
Background and understanding of neural network operators and mathematical operations: linear algebra, math libraries, desirable

Benefits

Competitive annual discretionary bonus program
Opportunity for annual RSU grants
Highly competitive benefits package

Company

Qualcomm

company-logo
Qualcomm designs wireless technologies and semiconductors that power connectivity, communication, and smart devices.

H1B Sponsorship

Qualcomm has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2013)
2024 (1910)
2023 (3216)
2022 (2885)
2021 (2104)
2020 (1181)

Funding

Current Stage
Public Company
Total Funding
$3.5M
1991-12-20IPO
1988-01-01Undisclosed· $3.5M

Leadership Team

leader-logo
Cristiano Amon
President and Chief Executive Officer
linkedin
I
Isaac Eteminan
CEO
linkedin
Company data provided by crunchbase