Zyphra · 2 months ago
Research Scientist - Agency and Reasoning
Zyphra is an artificial intelligence company based in Palo Alto, California. The Research Scientist will contribute to the Agency and Reasoning Team by performing novel research in reinforcement learning and applying ideas to the next generation of language models.
Artificial Intelligence (AI)Cloud ComputingMachine LearningSoftware
Responsibilities
Perform novel research in reinforcement learning, post-training, and human preference learning
Apply ideas at scale to next generation of language models
Qualification
Required
Experience and aptitude with reinforcement learning, either in the context of language model reasoning or more classical RL tasks
Experience with language model supervised finetuning and preference learning methods such as DPO, simPO, etc
Experience with context-length extension methods
A good intuitive ability to understand model behaviors and correct them through iterative fine-tuning
Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation
Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics)
Previously published machine learning research in well-respected venues
Highly proficient with PyTorch and Python
We are excited and able to rapidly learn new fields and implement new ideas
Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale
Preferred
Strong research taste and intuition
The ability to work through a research project from conception to execution to write-up
Strong implementation and prototyping skillset
A researcher who can take an idea from conception to experimentation extremely quickly
The ability to work well and cooperate with others in a high-paced research setting
Curiosity, interest, and joy in understanding intelligence
Benefits
Comprehensive medical, dental, vision, and FSA plans
Competitive compensation and 401(k)
Relocation and immigration support on a case-by-case basis
On-site meals prepared by a dedicated culinary team; Thursday Happy Hours
In-person team in Palo Alto, CA, with a collaborative, high-energy environment
Company
Zyphra
Zyphra is superintelligence research and product company based in San Francisco, California.
H1B Sponsorship
Zyphra has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
Funding
Current Stage
Growth StageTotal Funding
$100M2025-06-09Series A· $100M
2023-06-09Seed
2021-11-18Pre Seed
Recent News
2025-11-30
2025-11-27
Company data provided by crunchbase