Cerebro ยท 4 days ago
Founding ML inference Engineer
Cerebro is a stealth-mode, VC-backed AI startup focused on developing cutting-edge machine learning infrastructure. They are seeking a Founding ML Inference Engineer to design high-performance model serving systems and build distributed training infrastructure for next-generation diffusion LLMs.
Responsibilities
Design and optimize high-performance model serving systems for ultra-low-latency inference of next-generation diffusion LLMs
Build distributed training infrastructure supporting large-scale, state-of-the-art machine learning
Develop observability and monitoring solutions to keep ML systems robust in production environments
Optimize infrastructure costs and resource utilization across powerful GPU clusters
Create efficient data storage and retrieval systems tailored for ML workloads
Collaborate with world-class ML researchers to translate theoretical advances into practical systems
Qualification
Required
BS/MS/PhD in Computer Science, Engineering, or equivalent experience
Knowledge of ML serving frameworks like vLLM, TensorRT, or ONNX Runtime
Deep understanding of ML frameworks (PyTorch, TensorFlow) from a systems perspective
Familiarity with high-performance computing, GPU programming (CUDA), and distributed training (data/model/pipeline parallelism)
Strong coding skills in Python and at least one systems language (C++, Rust, or Go)
Experience with containerization (Docker), orchestration (Kubernetes), and CI/CD
Preferred
Building and maintaining large-scale ML training clusters
Experience with cloud platforms (AWS, GCP, Azure) and distributed systems
Familiarity with ML workflow orchestration tools (Kubeflow, Airflow)
Skills in performance optimization, profiling, and MLOps practices
Benefits
Health
Dental
Vision
PTO
Flexible vacation
Company
Cerebro
Cerebro partners with companies backed by some of the worlds most prominent VC funds (a16z, Andreessen Horowitz, Sequoia) and accelerator programs including a significant portion of the Y Combinator portfolio to solve startup's most critical hiring challenges across AI, ML & robotics
Funding
Current Stage
Early StageCompany data provided by crunchbase