DeepRec.ai · 2 weeks ago
Senior LLM Research Scientist
DeepRec.ai is a frontier-stage research group focused on creating intelligent agents that can reason, plan, and act across the physical world. The Senior LLM Research Scientist will develop advanced models and strategies for agent architectures, working closely with engineering and data teams to integrate these models into real-world applications.
Responsibilities
Develop advanced models and prompting systems for planning, multi-step reasoning, and structured tool use
Lead training initiatives across SFT, RLHF/DPO, verifier-guided RL, and modular expert architectures to strengthen robustness and controllability
Define schemas, tool-calling strategies, policy constraints, safety mechanisms, and recovery pathways for agent behavior
Partner closely with engineering, simulation, and data teams to test, train, and evaluate models embedded in real production-like toolchains
Qualification
Required
Significant experience in LLM research, agent reasoning models, or structured tool-use frameworks
Strong background working with SFT, RLHF, DPO, or reinforcement-learning-from-verification methods
Demonstrated ability to design, analyze, and improve long-horizon behaviors and decomposition strategies
Comfortable working across ML research, systems engineering, and real-world experimentation in a fast-moving environment
A track record of excellence and ownership in technically demanding domains