Sr. Software Engineer - Perf and Benchmarking jobs in United States
cer-icon
Apply on Employer Site
company-logo

CoreWeave · 1 week ago

Sr. Software Engineer - Perf and Benchmarking

CoreWeave is The Essential Cloud for AI™, providing technology and tools for innovators to build and scale AI. The Senior Engineer will play a crucial role in the Benchmarking & Performance team, focusing on performance data warehousing and achieving industry-leading performance benchmarking publications.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingCloud InfrastructureInformation TechnologyMachine Learning
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Build and improve Kubernetes-native benchmarking services that measure latency, throughput, jitter, and cost-per-request across CoreWeave’s compute stack
Implement and maintain benchmarking workflows for end-to-end MLPerf Training and Inference runs, including workload setup, cluster configuration, runbooks, and result validation
Lead design reviews and drive architecture within the team; decompose multi-service work into clear milestones
Mentor junior engineers; review cross-team designs and elevate coding/testing standards
Help ensure reproducible, well-documented benchmarking processes

Qualification

KubernetesPythonPerformance benchmarkingDistributed systemsHigh-performance computingCI/CDObservability stacksGoC++CommunicatorMentoringCollaboration

Required

5+ years of experience building distributed systems, high-performance computing, or cloud services
Strong coding in Python or Go (C++ a plus) and deep familiarity with networked systems and performance
Hands-on experience with Kubernetes at production scale, CI/CD, and observability stacks (Prometheus, Grafana, OpenTelemetry)
Experience with performance-critical GPU systems (CUDA, NCCL, RDMA, NVLink/PCIe, memory bandwidth) and model-serving stacks (llm-d, vLLM, TensorRT-LLM, Megatron-LM)
Strong communicator comfortable collaborating with cross-functional teams and external partners

Preferred

Experience with time-series databases, LSM-based storage engines, or custom data pipelines
Experience running MLPerf submissions or similar large-scale audited benchmarks
Contributions to OSS projects such as llm-d, vLLM or PyTorch
Exposure to benchmarking large GPU fleets or multi-region clusters
Experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies

Benefits

Medical, dental, and vision insurance - 100% paid for by CoreWeave
Company-paid Life Insurance
Voluntary supplemental life insurance
Short and long-term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Ability to Participate in Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family-Forming support provided by Carrot
Paid Parental Leave
Flexible, full-service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
A work culture focused on innovative disruption

Company

CoreWeave

twittertwittertwitter
company-logo
CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads.

Funding

Current Stage
Public Company
Total Funding
$23.37B
Key Investors
Jane Street CapitalStack CapitalCoatue
2025-12-08Post Ipo Debt· $2.54B
2025-11-12Post Ipo Debt· $1B
2025-08-20Post Ipo Secondary

Leadership Team

leader-logo
Michael Intrator
Chief Executive Officer
linkedin
leader-logo
Nitin Agrawal
Chief Financial Officer
linkedin
Company data provided by crunchbase