Research Engineer, Frontier Speculative Decoding jobs in United States
cer-icon
Apply on Employer Site
company-logo

Together AI · 2 weeks ago

Research Engineer, Frontier Speculative Decoding

Together AI is a research-driven artificial intelligence company focused on building an Inference Platform for advanced generative AI models. The Research Engineer role involves bridging research and real-world applications, fine-tuning models, and collaborating with customers to develop specialized tools that address business challenges.

AI InfrastructureArtificial Intelligence (AI)Generative AIInternetIT InfrastructureOpen Source
check
H1B Sponsor Likelynote

Responsibilities

Design and iterate on novel speculator algorithms, combining architectural innovations with carefully curated data to push the frontier of accuracy–efficiency tradeoffs
Be the critical link between raw data and a production-ready model, seeing your work directly impact our customers' success
Work in a fast-paced, high-impact role at the cutting edge of generative AI
Collaborate with a team of experts dedicated to solving real-world, high-performance challenges
You'll collaborate directly with customers to understand their needs, and work closely with our core inference and Applied ML research teams to integrate your work into the production platform
A culture of deep technical ownership where you are empowered to take on and solve challenging problems

Qualification

Hyperparameter tuningPythonPyTorchData curationKubernetesSLURMDistributed trainingGenerative modelsComplex code navigationAttention to detail

Required

A genuine love for data curation and processing, with a meticulous attention to detail. You believe that great models start with great data
Demonstrated ability to perform effective hyperparameter searches and understand the trade-offs involved in tuning models for specific tasks
Experience working with and building on top of existing training codebases. You are comfortable navigating complex code and contributing to its improvement
Strong attention-to-detail in evaluating model checkpoints to ensure they meet strict quality, performance, and reliability standards
Experience with Python and PyTorch
Familiarity with SLURM and/or Kubernetes clusters and experience submitting and managing jobs in a high-performance computing environment
Familiarity with modern LLMs and generative models
Basic understanding of distributed training frameworks (e.g., FSDP, DeepSpeed)
Bachelor's, Master's degree, or Ph.D. in Computer Science, Computer Engineering, or a related field, or equivalent practical experience

Benefits

Startup equity
Health insurance
Other competitive benefits

Company

Together AI

twittertwittertwitter
company-logo
Together AI is a cloud-based platform designed for constructing open-source generative AI and infrastructure for developing AI models.

H1B Sponsorship

Together AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (19)
2024 (6)
2023 (3)

Funding

Current Stage
Growth Stage
Total Funding
$533.5M
Key Investors
Salesforce VenturesLux Capital
2025-02-20Series B· $305M
2024-03-13Series A· $106M
2023-11-29Series A· $102.5M

Leadership Team

leader-logo
Vipul Ved Prakash
Co-Founder & CEO
linkedin
leader-logo
Kae Ike Lim
Executive Assistant to Co-Founder and CEO
linkedin
Company data provided by crunchbase