Apply on Employer Site

Kalpa Labs (YC F25) · 3 weeks ago

Founding ML Research Engineer - Training Infrastructure

San Francisco, CA, US

Full-time

Onsite

Entry, Mid Level

$140K/yr - $200K/yr

Kalpa Labs is a startup focused on developing advanced machine learning technologies. They are seeking a Founding ML Research Engineer to build the infrastructure for training large-scale speech models, managing the training stack from design to implementation.

Artificial Intelligence (AI)SoftwareSpeech Recognition

H1B Sponsored

Responsibilities

Design and implement a production-grade training stack for large-scale speech model pre-training and post-training (SFT/RLHF-style, distillation, preference optimization, etc.)

Build scalable data + compute pipelines: dataset curation, filtering, mixing, tokenization/feature pipelines, evaluation harnesses, and experiment tracking

Own distributed training: performance profiling, stability, fault tolerance, checkpointing, resumption, and high-throughput I/O

Qualification

ML systems engineeringDistributed trainingLarge model trainingSpeech model experienceDebuggingExperiment trackingData pipeline building

Required

Strong ML systems and engineering depth (distributed training, performance, reliability)

Practical experience training large models (speech/audio is a plus but not required; language/vision experience is also relevant)

Comfort operating in ambiguity: you can spec, build, debug, and ship

Benefits

Will sponsor

Company

Kalpa Labs (YC F25)

We're building the next frontier of speech models that can pass the turing test over long conversations along with state-of-the-art infra for your voice agents.

Founded in 2025