Staff Software Engineer, Cluster Orchestration jobs in United States
cer-icon
Apply on Employer Site
company-logo

CoreWeave · 2 weeks ago

Staff Software Engineer, Cluster Orchestration

CoreWeave is The Essential Cloud for AI™, delivering a platform that enables innovators to build and scale AI with confidence. The Staff Software Engineer will play a key role in advancing the orchestration platform, ensuring workloads run seamlessly across massive GPU clusters and empowering customers to innovate faster.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingCloud InfrastructureInformation TechnologyMachine Learning
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Define architectural direction for CoreWeave’s orchestration platform
Own critical parts of the orchestration platform and other managed services
Drive cross-org initiatives in scheduling, quota enforcement, and scaling at hyperscale
Mentor senior engineers and establish org-wide best practices in reliability and observability
Ensure CoreWeave’s orchestration layer evolves to meet the demands of next-generation AI workloads

Qualification

Slurm/Kubernetes expertiseDistributed systems designGo programmingTechnical direction settingCloud-native developmentOrchestration technologiesDistributed workloads experienceReliability practices knowledgeMentorshipProblem-solving

Required

8+ years of software engineering experience
Proven track record designing and operating large-scale distributed systems in production
Deep expertise in Slurm/Kubernetes internals and cloud-native development
Advanced proficiency in Go and distributed systems design and cloud-native development
Experience setting technical direction and influencing cross-team architecture

Preferred

Familiarity with orchestration and workflow technologies such as Ray, Kubeflow, Kueue, Istio, Knative, or Argo Workflows
Deep expertise in Slurm/Kubernetes internals
Experience with distributed workloads, GPU-based applications, or ML pipelines
Knowledge of scheduling concepts like quota enforcement, pre-emption, and scaling strategies
Exposure to reliability practices including SLOs, alarms, and post-incident reviews
Experience with AI infrastructure and workloads (ML training, inference, or HPC)
Ability to mentor senior engineers and elevate organizational standards

Benefits

Medical, dental, and vision insurance - 100% paid for by CoreWeave
Company-paid Life Insurance
Voluntary supplemental life insurance
Short and long-term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Ability to Participate in Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family-Forming support provided by Carrot
Paid Parental Leave
Flexible, full-service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
A work culture focused on innovative disruption

Company

CoreWeave

twittertwittertwitter
company-logo
CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads.

Funding

Current Stage
Public Company
Total Funding
$23.37B
Key Investors
Jane Street CapitalStack CapitalCoatue
2025-12-08Post Ipo Debt· $2.54B
2025-11-12Post Ipo Debt· $1B
2025-08-20Post Ipo Secondary

Leadership Team

leader-logo
Michael Intrator
Chief Executive Officer
linkedin
leader-logo
Nitin Agrawal
Chief Financial Officer
linkedin
Company data provided by crunchbase