Apply on Employer Site

Prime Intellect · 3 months ago

Applied Research - RL & Agents

San Francisco

Full-time

Onsite

Mid Level

Prime Intellect is building the open superintelligence stack, enabling researchers and enterprises to run end-to-end reinforcement learning at scale. The role involves designing advanced AI agents, developing robust infrastructure, and bridging customer needs with research priorities.

Agentic AIArtificial Intelligence (AI)Cloud ComputingMachine Learning

H1B Sponsored

Responsibilities

Advancing Agent Capabilities: Designing and iterating on next-generation AI agents that tackle real workloads—workflow automation, reasoning-intensive tasks, and decision-making at scale

Building Robust Infrastructure: Developing the distributed systems and coordination frameworks that enable these agents to operate reliably, efficiently, and at massive scale

Bridge Between Customers & Research: Translate customer needs into clear technical requirements that guide product and research priorities

Prototype in the Field: Rapidly design and deploy agents, evals, and harnesses alongside customers to validate solutions

Customer-Facing Engineering: Work side-by-side with customers to deeply understand workflows and bottlenecks

Prototype agents and eval harnesses tailored to real use cases, then hand off hardened systems to core teams

Translate customer insights into roadmap and research direction

Post-training & Reinforcement Learning: Design and implement novel RL and post-training methods (RLHF, RLVR, GRPO, etc.) to align large models with domain-specific tasks

Build evaluation harnesses and verifiers to measure reasoning, robustness, and agentic behavior in real-world workflows

Prototype multi-agent and memory-augmented systems to expand capabilities for customer-facing solutions

Agent Development & Infrastructure: Rapidly prototype and iterate on AI agents for automation, workflow orchestration, and decision-making

Extend and integrate with agent frameworks to support evolving feature requests and performance requirements

Architect and maintain distributed training/inference pipelines, ensuring scalability and cost efficiency

Develop observability and monitoring (Prometheus, Grafana, tracing) to ensure reliability and performance in production deployments

Qualification

Machine Learning EngineeringReinforcement LearningDistributed TrainingContainerizationResearch ContributionsCustomer-Facing SkillsWorkflow AutomationMonitoringDecision-MakingObservability

Required

Strong background in machine learning engineering, with experience in post-training, RL, or large-scale model alignment

Deep expertise in distributed training/inference frameworks (e.g., vLLM, sglang, Ray, Accelerate)

Experience deploying containerized systems at scale (Docker, Kubernetes, Terraform)

Track record of research contributions (publications, open-source contributions, benchmarks) in ML/RL

Passion for advancing the state-of-the-art in reasoning and building practical, agentic AI systems

Benefits

Competitive Compensation + equity incentives

Flexible Work (remote or San Francisco)

Visa Sponsorship & relocation support

Professional Development budget

Team Off-sites & conference attendance

Company

Prime Intellect

Prime Intellect is a full-stack platform that offers agentic training infrastructure for organizations to train frontier AI using LLMs.

Founded in 2024

San Francisco, California, USA

11-50 employees

https://www.primeintellect.ai/

Funding

Current Stage

Early Stage

Total Funding

$70.44M

Key Investors

Founders Fund

2026-01-15Series Unknown· $49.94M

2025-02-28Seed· $15M

2024-04-22Seed· $5.5M

Recent News

36kr.com

Using only 512 H200s, the 106B model breaks through with distributed RL and is open-sourced across the network.

2025-12-10

Techmeme

Prime Intellect debuts INTELLECT-3, an RL-trained 106B parameter open source MOE model it claims outperforms larger models across math, code, science, reasoning (Prime Intellect)

2025-11-28

WIRED

This Startup Wants to Spark a US DeepSeek Moment

2025-10-09

Company data provided by crunchbase