Kalpa Labs (YC F25) · 3 weeks ago
Founding ML Research Engineer - Training Infrastructure
Kalpa Labs is a startup focused on developing advanced machine learning technologies. They are seeking a Founding ML Research Engineer to build the infrastructure for training large-scale speech models, managing the training stack from design to implementation.
Artificial Intelligence (AI)SoftwareSpeech Recognition
Responsibilities
Design and implement a production-grade training stack for large-scale speech model pre-training and post-training (SFT/RLHF-style, distillation, preference optimization, etc.)
Build scalable data + compute pipelines: dataset curation, filtering, mixing, tokenization/feature pipelines, evaluation harnesses, and experiment tracking
Own distributed training: performance profiling, stability, fault tolerance, checkpointing, resumption, and high-throughput I/O
Qualification
Required
Strong ML systems and engineering depth (distributed training, performance, reliability)
Practical experience training large models (speech/audio is a plus but not required; language/vision experience is also relevant)
Comfort operating in ambiguity: you can spec, build, debug, and ship
Benefits
Will sponsor
Company
Kalpa Labs (YC F25)
We're building the next frontier of speech models that can pass the turing test over long conversations along with state-of-the-art infra for your voice agents.