Reinforcement Learning Engineer- Weights & Biases jobs in United States
cer-icon
Apply on Employer Site
company-logo

Weights & Biases · 2 weeks ago

Reinforcement Learning Engineer- Weights & Biases

Weights & Biases, acquired by CoreWeave, aims to create a powerful end-to-end platform for AI development. The role focuses on applied research to solve challenges in continuous learning for AI agents, leveraging extensive GPU resources to drive innovation.

AI InfrastructureArtificial Intelligence (AI)Data VisualizationDeveloper ToolsGenerative AIMachine Learning
check
Comp. & Benefits
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Generate and investigate research ideas towards solving the remaining obstacles to continuous learning in production
Work with the broader OpenPipe team to validate these research directions across real customer tasks

Qualification

Reinforcement LearningLarge Language ModelsCUDA programmingKubernetesFastAPIProblem-solvingTeam collaborationAdaptability

Required

You have trained LLMs to be SOTA on specific tasks
You have opinions on whether sequence-level or token-level importance ratios are more effective
You probably shared the ScaleRL paper in your group chats, and kicked off a few ablations after you read it
This is an applied research role
You will be expected to generate and investigate research ideas towards solving the remaining obstacles to continuous learning in production
You will work with the broader OpenPipe team to validate these research directions across real customer tasks
We are very GPU rich and are ready to direct an enormous amount of compute at this effort
The most important qualification by far is that you learn fast and can ship
This role will inevitably involve a lot of learning on the job
Engineers on our team touch everything from CUDA kernels to high-performance LLM tracing dashboards
You should be great at what you do—we'll look for impressive, impactful accomplishments from past projects or roles
Although we operate as part of a larger company, the OpenPipe team is small, has a large degree of autonomy and drives our own roadmap and priorities

Benefits

Medical, dental, and vision insurance - 100% paid for by CoreWeave
Company-paid Life Insurance
Voluntary supplemental life insurance
Short and long-term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Ability to Participate in Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family-Forming support provided by Carrot
Paid Parental Leave
Flexible, full-service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
A work culture focused on innovative disruption

Company

Weights & Biases

company-logo
Weights & Biases is a developer-first MLOps platform that builds machine learning performance visualization tools.

Funding

Current Stage
Growth Stage
Total Funding
$250M
Key Investors
NVIDIAInsight PartnersCoatue
2025-03-04Acquired
2023-09-01Secondary Market
2023-08-09Series Unknown· $50M

Leadership Team

leader-logo
Chris Van Pelt
Co-Founder & CISO
linkedin
leader-logo
Shawn Lewis
Founder/CTO
linkedin
Company data provided by crunchbase