Zyphra · 2 months ago
Research Engineer, Language Model Pre-Training
Zyphra is an artificial intelligence company based in Palo Alto, California. As a Research Engineer specializing in Language Model Pre-training, you will shape the language model roadmap through end-to-end pretraining development, working closely with the pretraining team to integrate insights into next-generation models.
Artificial Intelligence (AI)Cloud ComputingMachine LearningSoftware
Responsibilities
Large-scale training runs and model parallelization
Performance optimization of our pretraining stack
Dataset collection, processing, and evaluation
Architecture and methodology research, including optimizer ablations
Qualification
Required
Strong engineering aptitude for rapidly implementing reliable and robust systems
Can rapidly learn new fields and are excited to implement new ideas
Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale
Preferred
Deep expertise and intuition for solving machine learning problems and training models
Experience with training on large-scale (multi-node) GPU clusters
Deep understanding of model training pipelines – including model/data parallelism, distributed optimizers, etc
Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing
Understanding of large-scale, highly parallel data processing pipelines
High proficiency with PyTorch and Python
Strong ability to dive into large pre-existing codebases and rapidly get up to speed
Published machine learning research in well-respected venues is a plus
Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Math, Physics)
Benefits
Comprehensive medical, dental, vision, and FSA plans
Competitive compensation and 401(k)
Relocation and immigration support on a case-by-case basis
On-site meals prepared by a dedicated culinary team; Thursday Happy Hours
In-person team in Palo Alto, CA, with a collaborative, high-energy environment
Company
Zyphra
Zyphra is superintelligence research and product company based in San Francisco, California.
H1B Sponsorship
Zyphra has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
Funding
Current Stage
Growth StageTotal Funding
$100M2025-06-09Series A· $100M
2023-06-09Seed
2021-11-18Pre Seed
Recent News
2025-11-30
2025-11-27
Company data provided by crunchbase