Together AI · 2 weeks ago
Research Engineer, Frontier Speculative Decoding
Together AI is a research-driven artificial intelligence company focused on building an Inference Platform for advanced generative AI models. The Research Engineer role involves bridging research and real-world applications, fine-tuning models, and collaborating with customers to develop specialized tools that address business challenges.
AI InfrastructureArtificial Intelligence (AI)Generative AIInternetIT InfrastructureOpen Source
Responsibilities
Design and iterate on novel speculator algorithms, combining architectural innovations with carefully curated data to push the frontier of accuracy–efficiency tradeoffs
Be the critical link between raw data and a production-ready model, seeing your work directly impact our customers' success
Work in a fast-paced, high-impact role at the cutting edge of generative AI
Collaborate with a team of experts dedicated to solving real-world, high-performance challenges
You'll collaborate directly with customers to understand their needs, and work closely with our core inference and Applied ML research teams to integrate your work into the production platform
A culture of deep technical ownership where you are empowered to take on and solve challenging problems
Qualification
Required
A genuine love for data curation and processing, with a meticulous attention to detail. You believe that great models start with great data
Demonstrated ability to perform effective hyperparameter searches and understand the trade-offs involved in tuning models for specific tasks
Experience working with and building on top of existing training codebases. You are comfortable navigating complex code and contributing to its improvement
Strong attention-to-detail in evaluating model checkpoints to ensure they meet strict quality, performance, and reliability standards
Experience with Python and PyTorch
Familiarity with SLURM and/or Kubernetes clusters and experience submitting and managing jobs in a high-performance computing environment
Familiarity with modern LLMs and generative models
Basic understanding of distributed training frameworks (e.g., FSDP, DeepSpeed)
Bachelor's, Master's degree, or Ph.D. in Computer Science, Computer Engineering, or a related field, or equivalent practical experience
Benefits
Startup equity
Health insurance
Other competitive benefits
Company
Together AI
Together AI is a cloud-based platform designed for constructing open-source generative AI and infrastructure for developing AI models.
H1B Sponsorship
Together AI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (19)
2024 (6)
2023 (3)
Funding
Current Stage
Growth StageTotal Funding
$533.5MKey Investors
Salesforce VenturesLux Capital
2025-02-20Series B· $305M
2024-03-13Series A· $106M
2023-11-29Series A· $102.5M
Leadership Team
Recent News
Dynamic Business
2026-01-20
2025-11-27
Company data provided by crunchbase