Sailplane ยท 14 hours ago
Senior ML Engineer
Sailplane is an early-stage AI infrastructure startup focused on building a self-driving cloud for autonomous management of AI data centers. The Senior ML Engineer will lead the build and operations of LLMs in production, ensuring high-quality performance and collaboration across teams.
Artificial Intelligence (AI)Information TechnologyMachine Learning
Responsibilities
Build, deploy, monitor, and operate LLMs in production on-premises in diverse customer environments
Implement MLOps best practices (CI/CD pipelines, containerization, continuous monitoring) to ensure reliable performance
Benchmark performance and recommend solutions to improve customer deployments including hardware sizing for target throughput (tokens per second, concurrent user sessions)
Experiment and iterate on models by tuning parameters and testing new approaches, continuously improving accuracy and effectiveness through rigorous evaluation
Document and ensure reproducibility of ML work, track experiments, code, and model versions to foster knowledge sharing and maintain high standards in the team
Collaborate cross-functionally with software engineers, customers, and product stakeholders
Qualification
Required
5+ years of experience in ML engineering, preferably in a VC-backed startup environment
Hands-on experience deploying models at scale, including familiarity with containerization (Docker, Kubernetes) and cloud platforms (AWS, GCP, or Azure) to build and operate ML systems in production
Experience with Prometheus, Grafana, distributed tracing, or ML-specific monitoring (Weights & Biases, MLflow for production)
Proficiency in programming (especially Python) and experience with modern ML frameworks/libraries such as TensorFlow, PyTorch, etc
Deep understanding of machine learning algorithms and the model development lifecycle (data preprocessing, training, parameter tuning, and evaluation)
Proven track record of delivering software that creates real value for users
Excellent communication skills with an ability to explain complex ML concepts to non-experts, and a collaborative approach to working with cross-functional teams and partners
Fluency in models
Adept at integrating production infrastructure and observability
Lead performance benchmarking
Comfortable working in code and diverse production environments
Strong focus on correctness and quality
Preferred
Experience in a VC-backed startup environment
Familiarity with containerization and cloud platforms
Experience with ML-specific monitoring tools
Benefits
Comprehensive Health, Dental, and Vision coverage beginning on the first day for employees and their families, paid 100% by Sailplane.
Equity grant participation.
Flexible PTO with no accrual or set annual cap, plus 15 paid holidays per year.
Health and Wellness stipend ($3,000 annually) to help support your personal health goals.
AI tools stipend ($1,200 annually) to encourage hands-on familiarity with emerging tools.
12 weeks of paid parental leave.
Company
Sailplane
Sailplane is a technological company focused on Hierarchical Planning AI research.
Funding
Current Stage
Early StageTotal Funding
unknown2023-10-01Seed
Company data provided by crunchbase