Research Engineer / Scientist – Reinforcement Learning (RL) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Percepta · 3 months ago

Research Engineer / Scientist – Reinforcement Learning (RL)

Percepta is a company focused on transforming critical industries through applied AI. The Research Engineer/Scientist will work at the intersection of reinforcement learning research and real-world deployment, collaborating with product managers and engineers to drive AI transformation in sectors like healthcare and energy.

Computer Software
check
H1B Sponsor Likelynote

Responsibilities

Identifying which real-world challenges are tractable for RL-guided decision making
Develop RL methods to perform complex tasks in domains like planning, decision-making, or optimization
Develop and maintain the experimental infrastructure that powers our research, from simulation environments and data pipelines to training and evaluation frameworks
Conduct in-the-wild evaluations at scale that drive millions of dollars in value
Partner with our applied AI engineers to transition successful research ideas into robust features of our Mosaic platform
Communicate research outcomes to both technical and non-technical stakeholders, making sure everyone understands the 'so what' of research and how to apply it

Qualification

Reinforcement LearningPythonDistributed systemsLarge scale LLM trainingKubernetesAsynchronous trainingEffective communicationExtreme ownership

Required

Have an MS/PhD in Computer Science, ML, or related field, or equivalent experience
Have a track record of effective RL work
Are motivated by impact in critical industries including healthcare, supply chains, energy, and finance
Understand how to perform rigorous RL experimentation
Enjoy extreme ownership
Believe that AI can drive transformative change in critical industries
High performance, large scale distributed systems
Large scale LLM training or RL training
Possess strong programming skills, especially in Python
Implementing LLM post-training algorithms
Experience with vLLM/SGLang, Ray, Kubernetes (or AWS EKS)
Experience with distributed checkpointing, multi-node, multi-gpu training, custom KV-caching
Experience with asynchronous training and inference, either with VeRL, ROLL, SkyRL, AReal, or with RL libraries like CleanRL

Company

Percepta

twitter
company-logo
Percepta (a GC Transformation Company) combines applied AI engineering with frontier research to transform enterprises.

H1B Sponsorship

Percepta has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (2)

Funding

Current Stage
Early Stage
Company data provided by crunchbase