Qualcomm · 6 hours ago
AI Performance Engineer (Cloud AI Engineering), Sr | Staff | Sr. Staff
Qualcomm Technologies, Inc. is utilizing its traditional strengths in digital wireless technologies to play a central role in the evolution of Cloud AI. They are seeking an AI Performance Engineer to join their team, focusing on optimizing models for efficient inference and collaborating across teams to address performance challenges.
Artificial Intelligence (AI)Generative AISoftwareTelecommunicationsWireless
Responsibilities
Convert, optimize and deploy models for efficient inference using PyTorch, ONNX
Work at the forefront of GenAI by understanding advanced algorithms (e.g. attention mechanisms, MoEs) and numerics to identify new optimization opportunities
Performance analysis and optimization of LLM, VLM, and diffusion models for inference
Scale performance for throughput and latency constraints
Mapping the next generation AI workloads on top of current and future hardware designs
Work closely with customers to drive solutions by collaborating with internal compiler, firmware and platform teams
Analyze complex performance or stability issues to work towards final root cause of underlying problems
Create engineering solutions to deliver continuous insights into performance of AI workloads guiding the improvements over time
Design and implement high-level kernels, e.g. in Triton, with a focus on generating efficient, low-level code
Qualification
Required
Hands-on experience in building and optimizing language models, notably in PyTorch, ONNX, preferably in production-grade environments
Deep understanding of transformer architectures, attention mechanisms and performance trade-offs
Experience in workload mapping strategies exhibiting sharding or various parallelisms
Strong Python programming skills
Proactive learning about the latest inference optimization techniques
Understanding of computer architecture, ML accelerators, in-memory processing and distributed systems
Strong communication, problem-solving skills and ability to learn and work effectively in a fast-paced and collaborative environment
MS in Computer Science, Machine Learning, Computer Engineering or Electrical Engineering
Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 6+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience
Master's degree in Computer Science, Engineering, Information Systems, or related field and 5+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience
PhD in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience
Preferred
Background in neural network operators and mathematical operations, including linear algebra and math libraries
Understanding of machine learning compilers
Experience in converging accuracy and its evaluation methods
Knowledge of torch.compile or torchDynamo
PhD in Computer Science, Computer Engineering or Machine Learning
Benefits
Competitive annual discretionary bonus program
Opportunity for annual RSU grants
Highly competitive benefits package
Company
Qualcomm
Qualcomm designs wireless technologies and semiconductors that power connectivity, communication, and smart devices.
H1B Sponsorship
Qualcomm has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2013)
2024 (1910)
2023 (3216)
2022 (2885)
2021 (2104)
2020 (1181)
Funding
Current Stage
Public CompanyTotal Funding
$3.5M1991-12-20IPO
1988-01-01Undisclosed· $3.5M
Recent News
2026-01-14
2026-01-14
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
2026-01-13
Company data provided by crunchbase