Qualcomm · 4 days ago
Senior AI Platform Engineer
Qualcomm Incorporated is seeking a Senior AI Platforms Engineer to design, build, and operate the infrastructure that powers large-scale AI and ML workloads. This role focuses on LLM hosting and serving at scale, requiring expertise in Kubernetes, multi-cloud environments, and observability systems while collaborating with global teams to deliver secure and reliable AI platforms.
Artificial Intelligence (AI)Generative AISoftwareTelecommunicationsWireless
Responsibilities
Deploy and manage large language models (LLMs) at scale using AWS Bedrock, GCP Vertex, Azure AI Foundry and Kubernetes-based solutions
Optimize inference performance for throughput, latency, and cost efficiency
Build and maintain Kubernetes clusters for AI workloads with GPU scheduling, autoscaling, and high availability
Model and deploy auto scaling applications and APIs to existing Kubernetes clusters
Implement CI/CD pipelines and Infrastructure as Code (Terraform, Helm)
Design and implement observability stacks for large-scale systems, including metrics, logs, and traces
Manage large-scale search systems built on elasticsearch powering hybrid-search solutions
Deploy and scale agentic workflow orchestration systems (e.g., n8n) for AI-driven automation
Ensure reliability, security, and performance of workflow execution at scale
Operate across AWS, GCP, and Azure, leveraging managed AI services and GPU infrastructure
Work closely with globally distributed teams; provide documentation, mentorship, and participate in on-call rotations
Qualification
Required
5–7 years of experience in platform engineering, MLOps, or SRE roles
Strong hands-on experience with:
Kubernetes (production-grade deployments, autoscaling, GPU scheduling)
Cloud platforms: AWS (Bedrock, SageMaker), plus GCP and/or Azure
Python and scripting languages (Bash, PowerShell)
Linux systems administration
Proven experience hosting and serving LLMs at scale in production environments
Expertise in observability: Elasticsearch, Prometheus, Grafana, OpenTelemetry
Familiarity with agentic workflow systems (e.g., n8n) and scaling them for enterprise use
Strong understanding of networking, security, and IAM in cloud-native environments
Excellent communication skills and ability to work with global teams
Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience
OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 1+ year of Software Engineering or related work experience
OR PhD in Engineering, Information Systems, Computer Science, or related field
2+ years of academic or work experience with Programming Language such as C, C++, Java, Python, etc
Preferred
Experience with model serving frameworks (vLLM, Triton, KServe, Ray Serve)
Knowledge of vector databases (Elasticsearch vector, Milvus, Pinecone) for RAG workflows
Familiarity with service mesh (Istio/Linkerd), policy-as-code (OPA/Gatekeeper)
GPU optimization for inference workloads
Certifications: AWS Solutions Architect or ML Specialty, CKA/CKAD
Benefits
Competitive annual discretionary bonus program
Opportunity for annual RSU grants
Highly competitive benefits package
Company
Qualcomm
Qualcomm designs wireless technologies and semiconductors that power connectivity, communication, and smart devices.
H1B Sponsorship
Qualcomm has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2013)
2024 (1910)
2023 (3216)
2022 (2885)
2021 (2104)
2020 (1181)
Funding
Current Stage
Public CompanyTotal Funding
$3.5M1991-12-20IPO
1988-01-01Undisclosed· $3.5M
Recent News
2026-01-07
2026-01-07
2026-01-07
Company data provided by crunchbase