Writer · 15 hours ago
Platform engineer, MLOps
Maximize your interview chances
ContentGenerative AI
Growth OpportunitiesH1B Sponsor Likely
Insider Connection @Writer
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Work closely with AI/ML engineers and researchers to design and deploy a CI/CD pipeline that ensures safe and reproducible experiments.
Set up and manage monitoring, logging, and alerting systems for extensive training runs and client-facing APIs.
Ensure training environments are consistently available and prepared across multiple clusters.
Develop and manage containerization and orchestration systems utilizing tools such as Docker and Kubernetes.
Operate and oversee large Kubernetes clusters with GPU workloads.
Improve reliability, quality, and time-to-market of our suite of software solutions
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement
Provide primary operational support and engineering for multiple large-scale distributed software applications
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Professional experience with model training
Huggingface Transformers
Pytorch
vLLM
TensorRT
Infrastructure as code tools like Terraform
Scripting languages such as Python or Bash
Cloud platforms such as Google Cloud, AWS or Azure
Git and GitHub workflows
Tracing and Monitoring
Familiar with high-performance, large-scale ML systems
A knack for troubleshooting complex systems and enjoy solving challenging problems
Proactive in identifying problems, performance bottlenecks, and areas for improvement
Take pride in building and operating scalable, reliable, secure systems
Comfortable with ambiguity and rapid change
Preferred
Familiar with monitoring tools such as Prometheus, Grafana, or similar
5+ years building core infrastructure
Experience running inference clusters at scale
Experience operating orchestration systems such as Kubernetes at scale
Benefits
Generous PTO, plus company holidays
Medical, dental, and vision coverage for you and your family
Paid parental leave for all parents (12 weeks)
Fertility and family planning support
Early-detection cancer testing through Galleri
Flexible spending account and dependent FSA options
Health savings account for eligible plans with company contribution
Annual work-life stipends for: Home office setup, cell phone, internet, Wellness stipend for gym, massage/chiropractor, personal training, etc., Learning and development stipend
Company-wide off-sites and team off-sites
Competitive compensation, company stock options and 401k
Company
Writer
Writer is a software firm that develops a full-stack generative AI platform delivering transformative ROI for enterprises.
H1B Sponsorship
Writer has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2022 (1)
Funding
Current Stage
Growth StageTotal Funding
$326MKey Investors
ICONIQ GrowthInsight PartnersUpfront Ventures
2024-11-12Series C· $200M
2023-09-18Series B· $100M
2021-11-15Series A· $21M
Recent News
2024-12-23
Crunchbase News
2024-12-19
Company data provided by crunchbase