Research Engineer / Scientist – Reinforcement Learning (RL) jobs in United States
info-icon
This job has closed.
company-logo

Percepta · 3 months ago

Research Engineer / Scientist – Reinforcement Learning (RL)

Percepta is dedicated to transforming critical institutions through applied AI, focusing on industries like healthcare and manufacturing. The Research Engineer/Scientist in Reinforcement Learning will advance capabilities in decision-making research and collaborate with product managers and engineers to implement solutions that improve operational efficiency.

Customer ServiceOutsourcing
check
H1B Sponsor Likelynote

Responsibilities

Identifying which real-world challenges are tractable for RL-guided decision making
Develop RL methods to perform complex tasks in domains like planning, decision-making, or optimization
Develop and maintain the experimental infrastructure that powers our research, from simulation environments and data pipelines to training and evaluation frameworks
Conduct in-the-wild evaluations at scale that drive millions of dollars in value
Partner with our applied AI engineers to transition successful research ideas into robust features of our Mosaic platform
Communicate research outcomes to both technical and non-technical stakeholders, making sure everyone understands the “so what” of research and how to apply it

Qualification

Reinforcement LearningPythonDistributed systemsLarge scale LLM trainingKubernetesAsynchronous trainingEffective communicationExtreme ownership

Required

Have an MS/PhD in Computer Science, ML, or related field, or equivalent experience
Have a track record of effective RL work
Are motivated by impact in critical industries including healthcare, supply chains, energy, and finance
Understand how to perform rigorous RL experimentation
Enjoy extreme ownership
Believe that AI can drive transformative change in critical industries
Identifying which real-world challenges are tractable for RL-guided decision making
Develop RL methods to perform complex tasks in domains like planning, decision-making, or optimization
Develop and maintain the experimental infrastructure that powers our research, from simulation environments and data pipelines to training and evaluation frameworks
Conduct in-the-wild evaluations at scale that drive millions of dollars in value
Partner with our applied AI engineers to transition successful research ideas into robust features of our Mosaic platform
Communicate research outcomes to both technical and non-technical stakeholders, making sure everyone understands the 'so what' of research and how to apply it

Preferred

High performance, large scale distributed systems
Large scale LLM training or RL training
Possess strong programming skills, especially in Python
Implementing LLM post-training algorithms
Experience with vLLM/SGLang, Ray, Kubernetes (or AWS EKS)
Experience with distributed checkpointing, multi-node, multi-gpu training, custom KV-caching
Experience with asynchronous training and inference, either with VeRL, ROLL, SkyRL, AReal, or with RL libraries like CleanRL

Company

Percepta

company-logo
Percepta is a global, contact services company that builds customer loyalty.

H1B Sponsorship

Percepta has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (2)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Ashlynn Woods
Global Employee Experience Partner
linkedin
leader-logo
LaSandra Johnson
Human Resources Business Partner II
linkedin
Company data provided by crunchbase