CoreWeave · 14 hours ago
Staff Software Engineer, Cluster Orchestration
CoreWeave is The Essential Cloud for AI™, delivering a platform of technology that enables innovators to build and scale AI. As a Staff Engineer, you will lead the long-term strategy for the orchestration platform, focusing on architectural direction and mentoring engineers to enhance reliability and observability.
AI InfrastructureArtificial Intelligence (AI)Cloud ComputingCloud InfrastructureInformation TechnologyMachine Learning
Responsibilities
You will play a key role in advancing CoreWeave’s orchestration platform including SUNK (Slurm on Kubernetes) and beyond, our Kubernetes-native foundation that powers AI training and inference at scale
As a Staff Engineer, you will be a technical leader shaping the long-term strategy for CoreWeave’s orchestration platform
You’ll define architectural direction, own critical parts of the orchestration platform and other managed services, and drive cross-org initiatives in scheduling, quota enforcement, and scaling at hyperscale
You’ll mentor senior engineers, establish org-wide best practices in reliability and observability, and ensure CoreWeave’s orchestration layer evolves to meet the demands of next-generation AI workloads
Qualification
Required
8+ years of software engineering experience
Proven track record designing and operating large-scale distributed systems in production
Deep expertise in Slurm/Kubernetes internals and cloud-native development
Advanced proficiency in Go and distributed systems design and cloud-native development
Experience setting technical direction and influencing cross-team architecture
Bachelor's or Master's degree in CS, EE, or related field
Preferred
Familiarity with orchestration and workflow technologies such as Ray, Kubeflow, Kueue, Istio, Knative, or Argo Workflows
Deep expertise in Slurm/Kubernetes internals
Experience with distributed workloads, GPU-based applications, or ML pipelines
Knowledge of scheduling concepts like quota enforcement, pre-emption, and scaling strategies
Exposure to reliability practices including SLOs, alarms, and post-incident reviews
Experience with AI infrastructure and workloads (ML training, inference, or HPC)
Ability to mentor senior engineers and elevate organizational standards
Benefits
Medical, dental, and vision insurance - 100% paid for by CoreWeave
Company-paid Life Insurance
Voluntary supplemental life insurance
Short and long-term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Ability to Participate in Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family-Forming support provided by Carrot
Paid Parental Leave
Flexible, full-service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
A work culture focused on innovative disruption
Company
CoreWeave
CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads.
Funding
Current Stage
Public CompanyTotal Funding
$23.37BKey Investors
Jane Street CapitalStack CapitalCoatue
2025-12-08Post Ipo Debt· $2.54B
2025-11-12Post Ipo Debt· $1B
2025-08-20Post Ipo Secondary
Recent News
The Motley Fool
2026-01-17
2026-01-17
Company data provided by crunchbase