Senior AI Platform Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Qualcomm · 4 months ago

Senior AI Platform Engineer

Qualcomm Incorporated is seeking a Senior AI Platforms Engineer to design, build, and operate infrastructure for large-scale AI and ML workloads, focusing on LLM hosting and serving. This role involves collaborating with global teams to ensure secure, reliable, and cost-efficient AI platforms while leveraging expertise in Kubernetes and multi-cloud environments.

Artificial Intelligence (AI)Generative AISoftwareTelecommunicationsWireless
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Deploy and manage large language models (LLMs) at scale using AWS Bedrock, GCP Vertex, Azure AI Foundry and Kubernetes-based solutions
Optimize inference performance for throughput, latency, and cost efficiency
Build and maintain Kubernetes clusters for AI workloads with GPU scheduling, autoscaling, and high availability
Model and deploy auto scaling applications and APIs to existing Kubernetes clusters
Implement CI/CD pipelines and Infrastructure as Code (Terraform, Helm)
Design and implement observability stacks for large-scale systems, including metrics, logs, and traces
Manage large-scale search systems built on elasticsearch powering hybrid-search solutions
Deploy and scale agentic workflow orchestration systems (e.g., n8n) for AI-driven automation
Ensure reliability, security, and performance of workflow execution at scale
Operate across AWS, GCP, and Azure, leveraging managed AI services and GPU infrastructure
Work closely with globally distributed teams; provide documentation, mentorship, and participate in on-call rotations

Qualification

KubernetesMLOpsLLM hostingMulti-cloud environmentsPythonLinux systems administrationAgentic workflow systemsNetworkingSecurityObservabilityCommunicationCollaboration

Required

5–7 years of experience in platform engineering, MLOps, or SRE roles
Strong hands-on experience with Kubernetes (production-grade deployments, autoscaling, GPU scheduling)
Strong hands-on experience with Cloud platforms: AWS (Bedrock, SageMaker), plus GCP and/or Azure
Strong hands-on experience with Python and scripting languages (Bash, PowerShell)
Strong hands-on experience with Linux systems administration
Proven experience hosting and serving LLMs at scale in production environments
Expertise in observability: Elasticsearch, Prometheus, Grafana, OpenTelemetry
Familiarity with agentic workflow systems (e.g., n8n) and scaling them for enterprise use
Strong understanding of networking, security, and IAM in cloud-native environments
Excellent communication skills and ability to work with global teams
Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience
OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 1+ year of Software Engineering or related work experience
OR PhD in Engineering, Information Systems, Computer Science, or related field
2+ years of academic or work experience with Programming Language such as C, C++, Java, Python, etc

Preferred

Experience with model serving frameworks (vLLM, Triton, KServe, Ray Serve)
Knowledge of vector databases (Elasticsearch vector, Milvus, Pinecone) for RAG workflows
Familiarity with service mesh (Istio/Linkerd), policy-as-code (OPA/Gatekeeper)
GPU optimization for inference workloads
Certifications: AWS Solutions Architect or ML Specialty, CKA/CKAD

Benefits

Competitive annual discretionary bonus program
Opportunity for annual RSU grants
Highly competitive benefits package designed to support your success at work, at home, and at play

Company

Qualcomm

company-logo
Qualcomm designs wireless technologies and semiconductors that power connectivity, communication, and smart devices.

H1B Sponsorship

Qualcomm has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2013)
2024 (1910)
2023 (3216)
2022 (2885)
2021 (2104)
2020 (1181)

Funding

Current Stage
Public Company
Total Funding
$3.5M
1991-12-20IPO
1988-01-01Undisclosed· $3.5M

Leadership Team

leader-logo
Cristiano Amon
President and Chief Executive Officer
linkedin
I
Isaac Eteminan
CEO
linkedin
Company data provided by crunchbase