Research Scientist - Agency and Reasoning jobs in United States
cer-icon
Apply on Employer Site
company-logo

Zyphra · 2 months ago

Research Scientist - Agency and Reasoning

Zyphra is an artificial intelligence company based in Palo Alto, California. The Research Scientist will contribute to the Agency and Reasoning Team by performing novel research in reinforcement learning and applying ideas to the next generation of language models.

Artificial Intelligence (AI)Cloud ComputingMachine LearningSoftware
check
H1B Sponsorednote

Responsibilities

Perform novel research in reinforcement learning, post-training, and human preference learning
Apply ideas at scale to next generation of language models

Qualification

Reinforcement learningLanguage model finetuningData engineeringContext-length extensionPyTorchPythonCommunication skillsCollaboration skills

Required

Experience and aptitude with reinforcement learning, either in the context of language model reasoning or more classical RL tasks
Experience with language model supervised finetuning and preference learning methods such as DPO, simPO, etc
Experience with context-length extension methods
A good intuitive ability to understand model behaviors and correct them through iterative fine-tuning
Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation
Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics)
Previously published machine learning research in well-respected venues
Highly proficient with PyTorch and Python
We are excited and able to rapidly learn new fields and implement new ideas
Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale

Preferred

Strong research taste and intuition
The ability to work through a research project from conception to execution to write-up
Strong implementation and prototyping skillset
A researcher who can take an idea from conception to experimentation extremely quickly
The ability to work well and cooperate with others in a high-paced research setting
Curiosity, interest, and joy in understanding intelligence

Benefits

Comprehensive medical, dental, vision, and FSA plans
Competitive compensation and 401(k)
Relocation and immigration support on a case-by-case basis
On-site meals prepared by a dedicated culinary team; Thursday Happy Hours
In-person team in Palo Alto, CA, with a collaborative, high-energy environment

Company

Zyphra

twittertwitter
company-logo
Zyphra is superintelligence research and product company based in San Francisco, California.

H1B Sponsorship

Zyphra has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Growth Stage
Total Funding
$100M
2025-06-09Series A· $100M
2023-06-09Seed
2021-11-18Pre Seed
Company data provided by crunchbase