HPC Performance Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

CoreWeave · 2 days ago

HPC Performance Engineer

CoreWeave is a leading provider of cloud solutions specifically designed for AI applications. They are seeking a highly skilled HPC Performance Engineer to design, develop, and optimize bare-metal systems, ensuring high performance in the context of hardware updates and collaborating closely with cross-functional teams.

Artificial Intelligence (AI)Cloud ComputingCloud InfrastructureInformation TechnologyMachine Learning
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Develop and maintain tools for establishing systems performance baselines
Develop and maintain performance regression analysis testing automation
Design and maintain performance regression test pipelines for HPC workloads
Debug and Tune fabric-level performance to ensure low-latency high throughput configurations
Development of telemetry for performance analysis across distributed clusters of servers
Triage and fix performance issues in Linux
Collect data, produce metrics and visualizations that communicate performance information compared to benchmarks; this data should lead to appropriate business decisions and toward greater automation that improves customer experience in relation to performance
Define Linux and OS requirements, specifications, and system architecture in relation to systems performance, in collaboration with cross-functional teams. Along with these responsibilities there will also be cross team collaboration to triage and resolve bottlenecks

Qualification

HPC Performance EngineeringLinux InternalsPerformance AnalysisAutomation TestingPythonKubernetesDockerMPI WorkloadsGoAnalytical SkillsDocumentation SkillsProblem-Solving Skills

Required

5+ years of professional experience in Systems/HPC Performance Engineering, Benchmarking, and/or Validation
Strong experience with MPI workloads and distributed system performance analysis
Familiarity with RoCE, InfiniBand, and GPUDirect/Data Direct I/O, NUMA, etc in HPC workloads
Hands-on use of public HPC benchmarks (HPCC, HPL, OSU, MLPerf-HPC, STREAM, IO500)
Extensive, deep experience in Linux internals
Fluency with a programming language geared toward automation (Python preferred, but others possible)
Experience writing robust, testable code
Experience diagnosing and fixing systems performance issues
Experiencing with implementing automation testing
Ability to effectively prioritize and communicate proposed features and fixes in a remote-employee environment
Strong passion for automation, with a commitment to automating processes comprehensively
Excellent documentation skills and attention to detail
Strong analytical and problem-solving abilities

Preferred

Familiarity with QA/QE best practices
Familiarity with Golang
Opinions about software version control and team collaboration
Experience working in Cloud environments
Experience as a software engineer writing large-scale applications
Experience in open-source community software development
Experience with machine learning is a huge bonus

Benefits

Medical, dental, and vision insurance - 100% paid for by CoreWeave
Company-paid Life Insurance
Voluntary supplemental life insurance
Short and long-term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Ability to Participate in Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family-Forming support provided by Carrot
Paid Parental Leave
Flexible, full-service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
A work culture focused on innovative disruption

Company

CoreWeave

twittertwittertwitter
company-logo
CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads.

Funding

Current Stage
Public Company
Total Funding
$23.37B
Key Investors
Jane Street CapitalStack CapitalCoatue
2025-12-08Post Ipo Debt· $2.54B
2025-11-12Post Ipo Debt· $1B
2025-08-20Post Ipo Secondary

Leadership Team

leader-logo
Michael Intrator
Chief Executive Officer
linkedin
leader-logo
Nitin Agrawal
Chief Financial Officer
linkedin
Company data provided by crunchbase