Apply on Employer Site

SID.ai · 3 months ago

Research Intern (Summer 2026)

San Francisco, CA

Internship

Onsite

Intern

Start in 2026 Summer

SID.ai is a startup focused on training AI to retrieve and reason over various data sources, aiming to enhance AI's capabilities beyond internet data. They are seeking a Research Intern for Summer 2026 to work on post-training reasoning, design RL training environments, and conduct model experiments.

Data IntegrationData ManagementData StorageDatabaseGenerative AIInformation TechnologyInfrastructureInternetNatural Language ProcessingSoftware

Responsibilities

Post-train reasoning into LLMs with GRPO and SFT

Design and iterate RL training environments for retrieval – unstructured, structured, web

Run small and large model experiments – yolo runs encouraged

Work on next-generation vision-first embedding models

Qualification

RL pipelines for language modelsTorchrun/accelerate/multi-node trainingPyTorchCUDAArticulating ideas

Required

Not afraid of formulas – a technical major is an indicator of this (but isn't the only one)

Thinks they can learn anything in 2 weeks, but isn't arrogant about it

Prefers .py to .tex

Familiar with RL pipelines for language models

Comfortable with torchrun/accelerate/multi-node training

Clever about getting the data needed – or synthetically generating it

Finds easy solutions to hard problems, but doesn't mind getting their hands dirty, i.e., jumping a layer down into PyTorch or CUDA

Familiar with 'You and Your Research.' Understands what it takes to do significant work

Must articulate ideas well! A big part of making successful models is telling people about them. This includes writing docs and technical reports at the minimum – and jumping on podcasts at the extreme