Apply on Employer Site

Hippocratic AI · 1 day ago

Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS)

Palo Alto, CA

Full-time

Onsite

Mid Level

Hippocratic AI is the leading generative AI company in healthcare, focused on transforming patient outcomes with a safety-first approach. The Audio Data Engineer will scale and improve speech datasets for Text-to-Speech (TTS) and speech synthesis systems, enhancing audio quality and building automation pipelines for processing.

Artificial Intelligence (AI)Foundational AIGenerative AIHealth CareInformation Technology

Growth Opportunities

H1B Sponsor Likely

Responsibilities

Clean, denoise, and enhance large volumes of recorded speech data for use in TTS and voice synthesis pipelines

Build and maintain automated audio preprocessing pipelines using scripting tools and open-source libraries

Apply techniques such as background noise removal, silence trimming, gain normalization, and sample rate conversion

Integrate tools like ffmpeg, sox, or Python-based scripts (pydub, torchaudio, librosa) into scalable workflows

Collaborate with ML researchers and speech scientists to deliver high-quality, ready-to-train datasets

Evaluate audio quality using perceptual and quantitative metrics, and maintain audio QA checklists

Qualification

Speech/audio cleaningPythonAutomated workflowsDigital audio principlesTTS model pipelinesAudio engineeringSignal processingCloud platformsAttention to detail

Required

Strong experience with speech/audio cleaning using tools such as iZotope RX, Audacity, Adobe Audition, or SoX

Proficiency in Python and audio-related scripting for automation and batch processing

Familiarity with digital audio principles, including sample rates, bit depth, frequency bands, and compression artifacts

Experience designing or operating scalable, automated workflows for handling audio at volume

Meticulous attention to detail in audio quality control and error spotting

Preferred

Experience working on TTS model pipelines (e.g., Tacotron, VITS, FastSpeech) or speech synthesis datasets

Background in audio engineering, phonetics, or signal processing

Familiarity with real-time or low-latency audio processing constraints

Experience with cloud platforms and tools for automation (e.g., AWS, Airflow, or containerized audio workflows)

Company

Hippocratic AI

Hippocratic AI is a healthcare technology company that develops safety-focused large-language models for medical applications.

Founded in 2023

Palo Alto, California, USA

51-200 employees

https://www.hippocraticai.com

H1B Sponsorship

Hippocratic AI has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (9)

2024 (1)

Funding

Current Stage

Growth Stage

Total Funding

$402M

Key Investors

AvenirKleiner PerkinsNVentures

2025-11-03Series C· $126M

2025-01-09Series B· $141M

2024-09-19Series A· $17M

Leadership Team

Alex Miller

Co-Founder

Amy McCarthy

Chief Nursing Officer

Recent News

Morningstar.com

BCG and Hippocratic AI Announce Strategic Collaboration to Deploy Agentic AI Across Biopharma and Medtech

2026-01-11

Mobihealthnews

Anthropic, Genmab partner to use Claude for R&D and more digital health news

2026-01-11

Business Wire

Hippocratic AI and Huron Consulting Group Announce Strategic Collaboration to Transform Healthcare Delivery and Innovation

2026-01-09

Company data provided by crunchbase