This job has closed.

Percepta · 3 months ago

Research Engineer / Scientist – Reinforcement Learning (RL)

Boston, MA

Full-time

Onsite

Mid, Senior Level

Percepta is dedicated to transforming critical institutions through applied AI, focusing on industries like healthcare and manufacturing. The Research Engineer/Scientist in Reinforcement Learning will advance capabilities in decision-making research and collaborate with product managers and engineers to implement solutions that improve operational efficiency.

Customer ServiceOutsourcing

H1B Sponsor Likely

Responsibilities

Identifying which real-world challenges are tractable for RL-guided decision making

Develop RL methods to perform complex tasks in domains like planning, decision-making, or optimization

Develop and maintain the experimental infrastructure that powers our research, from simulation environments and data pipelines to training and evaluation frameworks

Conduct in-the-wild evaluations at scale that drive millions of dollars in value

Partner with our applied AI engineers to transition successful research ideas into robust features of our Mosaic platform

Communicate research outcomes to both technical and non-technical stakeholders, making sure everyone understands the “so what” of research and how to apply it

Qualification

Reinforcement LearningPythonDistributed systemsLarge scale LLM trainingKubernetesAsynchronous trainingEffective communicationExtreme ownership

Required

Have an MS/PhD in Computer Science, ML, or related field, or equivalent experience

Have a track record of effective RL work

Are motivated by impact in critical industries including healthcare, supply chains, energy, and finance

Understand how to perform rigorous RL experimentation

Enjoy extreme ownership

Believe that AI can drive transformative change in critical industries

Identifying which real-world challenges are tractable for RL-guided decision making

Develop RL methods to perform complex tasks in domains like planning, decision-making, or optimization

Develop and maintain the experimental infrastructure that powers our research, from simulation environments and data pipelines to training and evaluation frameworks

Conduct in-the-wild evaluations at scale that drive millions of dollars in value

Partner with our applied AI engineers to transition successful research ideas into robust features of our Mosaic platform

Communicate research outcomes to both technical and non-technical stakeholders, making sure everyone understands the 'so what' of research and how to apply it

Preferred

High performance, large scale distributed systems

Large scale LLM training or RL training

Possess strong programming skills, especially in Python

Implementing LLM post-training algorithms

Experience with vLLM/SGLang, Ray, Kubernetes (or AWS EKS)

Experience with distributed checkpointing, multi-node, multi-gpu training, custom KV-caching

Experience with asynchronous training and inference, either with VeRL, ROLL, SkyRL, AReal, or with RL libraries like CleanRL

Company

Percepta

Glassdoor3.5

Percepta is a global, contact services company that builds customer loyalty.

Founded in 2000

Köln, Nordrhein-Westfalen, DEU

1001-5000 employees

http://www.percepta.com

H1B Sponsorship

Percepta has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2024 (2)