Sonatus · 3 months ago
Staff DevOps/MLOps Engineer
Sonatus is a company that is transforming the automotive industry with AI-enabled software-defined vehicles. They are looking for a highly experienced Staff DevOps & MLOps Engineer to architect, build, and scale their DevOps and MLOps platform, responsible for the full cloud CI/CD pipeline and machine learning model lifecycle.
AutomotiveCloud Data ServicesInformation TechnologySoftware
Responsibilities
Design and build the foundational, end-to-end DevOps and MLOps platform for our Generative AI systems, making critical decisions that span large language model-based systems evaluation, monitoring, and deployment
Implement the full DevOps and MLOps framework. You will build the CI/CD/CT (Continuous Integration/Delivery/Training) automation that takes models from experiment to production with velocity and reliability
Deploy, scale, and optimize our model serving infrastructure. You will manage GPU/NPU resources, minimize inference latency, and build robust monitoring to ensure our AI is always fast, accurate, and cost-effective
Create a single, cohesive set of best practices for the entire AI lifecycle. Your work will define how we handle model versioning, infrastructure as code, and production observability in one seamless system
Qualification
Required
A seasoned engineer with 8+ years of experience building and scaling production-grade cloud services and systems, with a strong focus on DevOps, MLOps, and/or SRE
A 'systems thinker' with a demonstrated ability to architect end-to-end solutions and a deep understanding of the full CI/CD pipeline and machine learning lifecycle
Deep proficiency in Python and Infrastructure as Code (e.g., Terraform, Pulumi, etc.)
Experience with MLOps tools (e.g., MLflow, Kubeflow, Vertex AI) and production monitoring frameworks
Enforce reproducibility, approvals, audit trails, PII handling, model cards, and policy/compliance (e.g., privacy, evals, guardrails)
Experience with robust ML deployment systems (e.g., Kubeflow, MLflow, model servers like BentoML or TensorFlow Serving)
Hands-on experience with public cloud platforms (GCP, AWS, and/or Azure) and containerization/orchestration (Docker, Kubernetes)
Package, version, and deploy software modules and AI models (batch & online) with blue/green or canary rollouts; build feature & model registries, and automate retraining
Preferred
Experience with Pytorch, vLLMs, and GPUs a plus
Experience with tracking Modes and Agentic drift is a plus
Experience with tuning serving stacks (GPU/CPU utilization, batching, quantization)
Direct experience building and operationalizing systems for LLMs, especially RAG pipelines, is a plus
Experience with vector databases (e.g., Pinecone, Weaviate) and embedding management from a deployment and scaling perspective is a plus
Benefits
Stock option plan
Health care plan (Medical, Dental & Vision)
Retirement plan (401k, IRA)
Life Insurance (Basic, Voluntary & AD&D)
Unlimited paid time off (Vacation, Sick & Public Holidays)
Family leave (Maternity, Paternity)
Flexible work arrangements
Free food & snacks in the office
Company
Sonatus
Sonatus provides in-vehicle and cloud software that enables automotive companies to achieve the full promise of software-defined vehicles.
H1B Sponsorship
Sonatus has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (18)
2024 (7)
2023 (6)
2022 (7)
2021 (5)
2020 (1)
Funding
Current Stage
Growth StageTotal Funding
$110MKey Investors
Foxconn Technology GroupTranslink Capital
2022-12-07Corporate Round· $75M
2021-07-21Series A· $35M
Recent News
2025-12-16
2025-12-09
Company data provided by crunchbase