Hippocratic AI · 21 hours ago
Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS)
Hippocratic AI is the leading generative AI company in healthcare, focused on transforming patient outcomes with a safety-first approach. The Audio Data Engineer will scale and improve speech datasets for Text-to-Speech (TTS) and speech synthesis systems, enhancing audio quality and building automation pipelines for processing.
Artificial Intelligence (AI)Foundational AIGenerative AIHealth CareInformation Technology
Responsibilities
Clean, denoise, and enhance large volumes of recorded speech data for use in TTS and voice synthesis pipelines
Build and maintain automated audio preprocessing pipelines using scripting tools and open-source libraries
Apply techniques such as background noise removal, silence trimming, gain normalization, and sample rate conversion
Integrate tools like ffmpeg, sox, or Python-based scripts (pydub, torchaudio, librosa) into scalable workflows
Collaborate with ML researchers and speech scientists to deliver high-quality, ready-to-train datasets
Evaluate audio quality using perceptual and quantitative metrics, and maintain audio QA checklists
Qualification
Required
Strong experience with speech/audio cleaning using tools such as iZotope RX, Audacity, Adobe Audition, or SoX
Proficiency in Python and audio-related scripting for automation and batch processing
Familiarity with digital audio principles, including sample rates, bit depth, frequency bands, and compression artifacts
Experience designing or operating scalable, automated workflows for handling audio at volume
Meticulous attention to detail in audio quality control and error spotting
Preferred
Experience working on TTS model pipelines (e.g., Tacotron, VITS, FastSpeech) or speech synthesis datasets
Background in audio engineering, phonetics, or signal processing
Familiarity with real-time or low-latency audio processing constraints
Experience with cloud platforms and tools for automation (e.g., AWS, Airflow, or containerized audio workflows)
Company
Hippocratic AI
Hippocratic AI is a healthcare technology company that develops safety-focused large-language models for medical applications.
H1B Sponsorship
Hippocratic AI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)
2024 (1)
Funding
Current Stage
Growth StageTotal Funding
$402MKey Investors
AvenirKleiner PerkinsNVentures
2025-11-03Series C· $126M
2025-01-09Series B· $141M
2024-09-19Series A· $17M
Recent News
2026-01-11
Company data provided by crunchbase