Speech Data Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Aiphoria · 4 days ago

Speech Data Engineer

Aiphoria is a company specializing in AI-driven products, and they are seeking a Speech Data Engineer to bridge the data market with technological needs. The role involves identifying unique data sources, evaluating their quality, and collaborating closely with internal teams to ensure dataset specifications align with model training needs.

Artificial Intelligence (AI)Enterprise ApplicationsEnterprise Software

Responsibilities

Collect and prepare speech datasets (ASR/TTS) across multiple languages when customer data is unavailable
Process raw audio data, including speech segmentation, speaker separation, and basic preprocessing
Run speech recognition and pseudo-labeling, and collaborate with crowdsourcing/labeling platforms to improve data quality
Understand and apply differences between ASR data (noisy, real-world speech) and TTS data (clean, high-quality recordings)
Organize, version, and maintain speech datasets, ensuring teams always know what data exists and where it lives
Support existing data infrastructure and pipelines (e.g. DVC)
Work with external data providers, evaluating dataset quality and contributing to make-vs-buy decisions

Qualification

Speech data processingLabeling toolsQuality assessment metricsMultilingual ASR/TTSAudio data validationData curationCollaboration skills

Required

Hands-on experience with speech data processing and labeling tools, such as VAD, Pyannote, whisper, and other segmentation or diarization frameworks
Familiarity with quality assessment metrics, including SNR (Signal-to-Noise Ratio) and other acoustic analysis indicators
Collect, process, and curate speech datasets, including audio recordings, transcripts, and metadata for multilingual ASR and TTS applications
Work closely with internal ASR/TTS development teams to align dataset specifications with model training needs
Label and validate audio data, ensuring transcription accuracy, speaker diversity, and consistent metadata standards

Benefits

Remote work opportunities
Competitive compensation surpassing market standards
A company with entrepreneurial spirit

Company

Aiphoria

twittertwitter
company-logo
Aiphoria is a technology company that provides AI-powered virtual employees and automation solutions for enterprise operations.

Funding

Current Stage
Growth Stage
Total Funding
$34M
Key Investors
Ratmir Timashev
2025-07-21Series A· $34M
Company data provided by crunchbase