Staff Software Engineer- AI Workload Orchestration jobs in United States
cer-icon
Apply on Employer Site
company-logo

CoreWeave · 3 hours ago

Staff Software Engineer- AI Workload Orchestration

CoreWeave is The Essential Cloud for AI™, providing a platform that enables innovators to build and scale AI confidently. They are seeking a Staff Software Engineer to lead the technical vision and architecture for their AI Workload Orchestration Platform, focusing on Kubernetes-native orchestration strategies for AI workloads.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingCloud InfrastructureInformation TechnologyMachine Learning
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Own the technical vision and architecture for major portions of the AI Workload Orchestration Platform
Design scalable, reliable orchestration primitives for AI workloads across multiple schedulers and runtimes
Lead cross-team architecture reviews and drive alignment across infrastructure, CKS, and managed inference teams
Define platform standards for reliability, observability, capacity management, and operational excellence
Identify and resolve systemic performance, scalability, and fairness issues across large GPU clusters
Mentor senior engineers and grow technical leadership within the organization
Represent the platform in technical reviews and influence broader CoreWeave platform strategy

Qualification

Distributed systemsCloud platformsGo programmingKubernetes internalsOrchestration frameworksAI infrastructureScheduling conceptsOperational mindsetCross-team influenceTechnical leadership

Required

8+ years of professional software engineering experience, with deep expertise in distributed systems or cloud platforms
Strong proficiency in Go and experience designing large-scale, long-lived production systems
Deep knowledge of Kubernetes internals, scheduling mechanisms, and controller-based architectures
Demonstrated experience designing or evolving orchestration, scheduling, or resource-management platforms
Proven ability to lead technical initiatives across teams without direct authority
Strong operational mindset with experience owning mission-critical systems at scale

Preferred

Hands-on experience with Kueue, Volcano, Ray, or similar Kubernetes-native orchestration frameworks
Background in AI infrastructure, ML platforms, HPC, or large-scale batch and streaming systems
Deep understanding of scheduling concepts including fairness, pre-emption, quota management, and multi-tenant isolation
Experience defining and operating SLOs, capacity models, and large-scale reliability improvements
Contributions to open-source infrastructure or orchestration projects

Benefits

Medical, dental, and vision insurance - 100% paid for by CoreWeave
Company-paid Life Insurance
Voluntary supplemental life insurance
Short and long-term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Ability to Participate in Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family-Forming support provided by Carrot
Paid Parental Leave
Flexible, full-service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
A work culture focused on innovative disruption

Company

CoreWeave

twittertwittertwitter
company-logo
CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads.

Funding

Current Stage
Public Company
Total Funding
$24.87B
Key Investors
Jane Street CapitalStack CapitalCoatue
2025-12-08Post Ipo Debt· $2.54B
2025-11-12Post Ipo Debt· $2.5B
2025-08-20Post Ipo Secondary

Leadership Team

leader-logo
Michael Intrator
Chief Executive Officer
linkedin
leader-logo
Brannin McBee
Founder & CDO
linkedin
Company data provided by crunchbase