Platform engineer, MLOps @ Writer | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
Platform engineer, MLOps jobs in San Francisco, CA
Be an early applicantLess than 25 applicants
company-logo

Writer · 1 day ago

Platform engineer, MLOps

ftfMaximize your interview chances
ContentGenerative AI
check
Growth Opportunities
check
H1B Sponsor Likelynote

Insider Connection @Writer

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Work closely with AI/ML engineers and researchers to design and deploy a CI/CD pipeline that ensures safe and reproducible experiments.
Set up and manage monitoring, logging, and alerting systems for extensive training runs and client-facing APIs.
Ensure training environments are consistently available and prepared across multiple clusters.
Develop and manage containerization and orchestration systems utilizing tools such as Docker and Kubernetes.
Operate and oversee large Kubernetes clusters with GPU workloads.
Improve reliability, quality, and time-to-market of our suite of software solutions
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement
Provide primary operational support and engineering for multiple large-scale distributed software applications

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Model trainingHuggingface TransformersPytorchKubernetesVLLMTensorRTInfrastructure as codeTerraformPythonGoogle CloudAWSAzureGitLarge-scale ML systemsOperational supportBashGitHub workflowsTracingMonitoringPrometheusGrafanaScalable systems

Required

Professional experience with model training
Huggingface Transformers
Pytorch
vLLM
TensorRT
Infrastructure as code tools like Terraform
Scripting languages such as Python or Bash
Cloud platforms such as Google Cloud, AWS or Azure
Git and GitHub workflows
Tracing and Monitoring
Familiar with high-performance, large-scale ML systems
A knack for troubleshooting complex systems and enjoy solving challenging problems
Proactive in identifying problems, performance bottlenecks, and areas for improvement
Take pride in building and operating scalable, reliable, secure systems
Comfortable with ambiguity and rapid change

Preferred

Familiar with monitoring tools such as Prometheus, Grafana, or similar
5+ years building core infrastructure
Experience running inference clusters at scale
Experience operating orchestration systems such as Kubernetes at scale

Benefits

Generous PTO, plus company holidays
Medical, dental, and vision coverage for you and your family
Paid parental leave for all parents (12 weeks)
Fertility and family planning support
Early-detection cancer testing through Galleri
Flexible spending account and dependent FSA options
Health savings account for eligible plans with company contribution
Annual work-life stipends for: Home office setup, cell phone, internet, Wellness stipend for gym, massage/chiropractor, personal training, etc., Learning and development stipend
Company-wide off-sites and team off-sites
Competitive compensation, company stock options and 401k

Company

Writer

twittertwittertwitter
company-logo
Writer is a software firm that develops a full-stack generative AI platform delivering transformative ROI for enterprises.

H1B Sponsorship

Writer has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2022 (1)

Funding

Current Stage
Growth Stage
Total Funding
$326M
Key Investors
ICONIQ GrowthInsight PartnersUpfront Ventures
2024-11-12Series C· $200M
2023-09-18Series B· $100M
2021-11-15Series A· $21M

Leadership Team

leader-logo
May Habib
Co-founder and CEO
linkedin
leader-logo
Waseem AlShikh
Co-founder and CTO
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot