Senior ML Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Sailplane ยท 14 hours ago

Senior ML Engineer

Sailplane is an early-stage AI infrastructure startup focused on building a self-driving cloud for autonomous management of AI data centers. The Senior ML Engineer will lead the build and operations of LLMs in production, ensuring high-quality performance and collaboration across teams.

Artificial Intelligence (AI)Information TechnologyMachine Learning

Responsibilities

Build, deploy, monitor, and operate LLMs in production on-premises in diverse customer environments
Implement MLOps best practices (CI/CD pipelines, containerization, continuous monitoring) to ensure reliable performance
Benchmark performance and recommend solutions to improve customer deployments including hardware sizing for target throughput (tokens per second, concurrent user sessions)
Experiment and iterate on models by tuning parameters and testing new approaches, continuously improving accuracy and effectiveness through rigorous evaluation
Document and ensure reproducibility of ML work, track experiments, code, and model versions to foster knowledge sharing and maintain high standards in the team
Collaborate cross-functionally with software engineers, customers, and product stakeholders

Qualification

ML engineering experienceModel deployment at scaleContainerization DockerContainerization KubernetesCloud platforms AWSCloud platforms GCPCloud platforms AzureProgramming (Python)ML frameworks TensorFlowML frameworks PyTorchML monitoring toolsFocus on qualityCommunication skillsCollaborative approach

Required

5+ years of experience in ML engineering, preferably in a VC-backed startup environment
Hands-on experience deploying models at scale, including familiarity with containerization (Docker, Kubernetes) and cloud platforms (AWS, GCP, or Azure) to build and operate ML systems in production
Experience with Prometheus, Grafana, distributed tracing, or ML-specific monitoring (Weights & Biases, MLflow for production)
Proficiency in programming (especially Python) and experience with modern ML frameworks/libraries such as TensorFlow, PyTorch, etc
Deep understanding of machine learning algorithms and the model development lifecycle (data preprocessing, training, parameter tuning, and evaluation)
Proven track record of delivering software that creates real value for users
Excellent communication skills with an ability to explain complex ML concepts to non-experts, and a collaborative approach to working with cross-functional teams and partners
Fluency in models
Adept at integrating production infrastructure and observability
Lead performance benchmarking
Comfortable working in code and diverse production environments
Strong focus on correctness and quality

Preferred

Experience in a VC-backed startup environment
Familiarity with containerization and cloud platforms
Experience with ML-specific monitoring tools

Benefits

Comprehensive Health, Dental, and Vision coverage beginning on the first day for employees and their families, paid 100% by Sailplane.
Equity grant participation.
Flexible PTO with no accrual or set annual cap, plus 15 paid holidays per year.
Health and Wellness stipend ($3,000 annually) to help support your personal health goals.
AI tools stipend ($1,200 annually) to encourage hands-on familiarity with emerging tools.
12 weeks of paid parental leave.

Company

Sailplane

twittertwitter
company-logo
Sailplane is a technological company focused on Hierarchical Planning AI research.

Funding

Current Stage
Early Stage
Total Funding
unknown
2023-10-01Seed
Company data provided by crunchbase