Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Hippocratic AI · 1 day ago

Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS)

Hippocratic AI is the leading generative AI company in healthcare, focused on transforming patient outcomes with a safety-first approach. The Audio Data Engineer will scale and improve speech datasets for Text-to-Speech (TTS) and speech synthesis systems, enhancing audio quality and building automation pipelines for processing.

Artificial Intelligence (AI)Foundational AIGenerative AIHealth CareInformation Technology
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Clean, denoise, and enhance large volumes of recorded speech data for use in TTS and voice synthesis pipelines
Build and maintain automated audio preprocessing pipelines using scripting tools and open-source libraries
Apply techniques such as background noise removal, silence trimming, gain normalization, and sample rate conversion
Integrate tools like ffmpeg, sox, or Python-based scripts (pydub, torchaudio, librosa) into scalable workflows
Collaborate with ML researchers and speech scientists to deliver high-quality, ready-to-train datasets
Evaluate audio quality using perceptual and quantitative metrics, and maintain audio QA checklists

Qualification

Speech/audio cleaningPythonAutomated workflowsDigital audio principlesTTS model pipelinesAudio engineeringSignal processingCloud platformsAttention to detail

Required

Strong experience with speech/audio cleaning using tools such as iZotope RX, Audacity, Adobe Audition, or SoX
Proficiency in Python and audio-related scripting for automation and batch processing
Familiarity with digital audio principles, including sample rates, bit depth, frequency bands, and compression artifacts
Experience designing or operating scalable, automated workflows for handling audio at volume
Meticulous attention to detail in audio quality control and error spotting

Preferred

Experience working on TTS model pipelines (e.g., Tacotron, VITS, FastSpeech) or speech synthesis datasets
Background in audio engineering, phonetics, or signal processing
Familiarity with real-time or low-latency audio processing constraints
Experience with cloud platforms and tools for automation (e.g., AWS, Airflow, or containerized audio workflows)

Company

Hippocratic AI

twittertwittertwitter
company-logo
Hippocratic AI is a healthcare technology company that develops safety-focused large-language models for medical applications.

H1B Sponsorship

Hippocratic AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)
2024 (1)

Funding

Current Stage
Growth Stage
Total Funding
$402M
Key Investors
AvenirKleiner PerkinsNVentures
2025-11-03Series C· $126M
2025-01-09Series B· $141M
2024-09-19Series A· $17M

Leadership Team

leader-logo
Alex Miller
Co-Founder
linkedin
leader-logo
Amy McCarthy
Chief Nursing Officer
linkedin
Company data provided by crunchbase