Qualcomm · 7 hours ago
MLOps Engineer - ML Platform
Qualcomm Technologies, Inc. is seeking a highly skilled and experienced Staff MLOps Engineer to contribute to the development and maintenance of their ML platform both on premises and AWS Cloud. The role involves architecting, deploying, and optimizing the ML platform that supports training of Machine Learning Models using advanced technologies and collaborating with cross-functional teams to ensure the smooth operation and scalability of the infrastructure.
Artificial Intelligence (AI)Generative AISoftwareTelecommunicationsWireless
Responsibilities
Architect, develop, and maintain the ML platform to support training and inference of ML models
Design and implement scalable and reliable infrastructure solutions for NVIDIA clusters both on premises and AWS Cloud
Collaborate with data scientists and software engineers to define requirements and ensure seamless integration of ML and Data workflows into the platform
Optimize the platform’s performance and scalability, considering factors such as GPU resource utilization, data ingestion, model training, and deployment
Monitor and troubleshoot system performance, identifying and resolving issues to ensure the availability and reliability of the ML platform
Implement and maintain CI/CD pipelines for automated model training, evaluation, and deployment using technologies like ArgoCD and Argo Workflow
Implement and maintain monitoring stack using Prometheus and Grafana to ensure the health and performance of the platform
Manage AWS services including EKS, EC2, VPC, IAM, S3, and EFS to support the platform
Implement logging and monitoring solutions using AWS CloudWatch and other relevant tools
Stay updated with the latest advancements in MLOps, distributed computing, and GPU acceleration technologies, and proactively propose improvements to enhance the ML platform
Qualification
Required
Bachelor's or Master's degree in Computer Science, Engineering, or a related field
Proven experience as an MLOps Engineer or similar role, with a focus on large-scale ML and/or Data infrastructure and GPU clusters
Strong expertise in configuring and optimizing NVIDIA DGX clusters for deep learning workloads
Proficient in using the Kubernetes platform, including technologies like Helm, ArgoCD, Argo Workflow, Prometheus, and Grafana
Solid programming skills in languages like Python, Go and experience with relevant ML frameworks (e.g., TensorFlow, PyTorch)
In-depth understanding of distributed computing, parallel computing, and GPU acceleration techniques
Familiarity with containerization technologies such as Docker and orchestration tools
Experience with CI/CD pipelines and automation tools for ML workflows (e.g., Jenkins, GitHub, ArgoCD)
Experience with AWS services such as EKS, EC2, VPC, IAM, S3, and EFS
Experience with AWS logging and monitoring tools
Strong problem-solving skills and the ability to troubleshoot complex technical issues
Excellent communication and collaboration skills to work effectively within a cross-functional team
Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 4+ years of Software Engineering or related work experience
OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience
OR PhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience
2+ years of work experience with Programming Language such as C, C++, Java, Python, etc
Preferred
Experience with training and deploying models
Knowledge of ML model optimization techniques and memory management on GPUs
Familiarity with ML-specific data storage and retrieval systems
Understanding of security and compliance requirements in ML infrastructure
Benefits
Competitive annual discretionary bonus program
Opportunity for annual RSU grants
Highly competitive benefits package
Company
Qualcomm
Qualcomm designs wireless technologies and semiconductors that power connectivity, communication, and smart devices.
H1B Sponsorship
Qualcomm has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2013)
2024 (1910)
2023 (3216)
2022 (2885)
2021 (2104)
2020 (1181)
Funding
Current Stage
Public CompanyTotal Funding
$3.5M1991-12-20IPO
1988-01-01Undisclosed· $3.5M
Recent News
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
2026-01-13
BiometricUpdate.com
2026-01-12
Company data provided by crunchbase