Audio AI Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

United Language Group · 2 months ago

Audio AI Engineer

United Language Group is dedicated to making communication accessible through AI-powered tools and multilingual services. They are seeking an Audio AI Engineer to develop and optimize systems for real-time speech-to-speech interpretation, focusing on integrating speech recognition, translation, and synthesis technologies.

InternetLanguage LearningService IndustryTranslation Service
check
H1B Sponsor Likelynote

Responsibilities

Design and optimize end-to-end Speech-to-Speech pipelines that integrate ASR, translation, and TTS with minimal latency
Build bidirectional interpretation systems that handle turn-taking, speaker identification, and context preservation across language boundaries
Collaborate with the Audio/Speech Engineer to optimize latency, quality, and robustness of speech components in the full pipeline
Work with the Staff ML Engineer to design efficient inference architectures and deployment strategies for real-time streaming systems
Develop streaming ASR and TTS systems capable of handling continuous, overlapping speech in interpretation scenarios
Benchmark and optimize latency across all pipeline stages (speech capture, recognition, translation, synthesis)
Integrate speaker diarization, acoustic environment adaptation, and speech enhancement into interpretation workflows
Partner with linguists and product teams to validate interpretation quality and gather domain-specific feedback

Qualification

ASRTTSStreaming audio architecturesPythonReal-time signal processingSpeech processingLow-latency production systemsInterpretation workflowsMultilingual challengesSpeech quality metrics

Required

Bachelor's or Master's Degree in Electrical Engineering, Computer Science, or related field
3+ years of experience in speech processing, audio engineering, or conversational AI systems
Deep expertise in ASR, TTS, and streaming audio architectures
Proficiency in Python, ML frameworks, and experience with real-time signal processing
Experience building low-latency production systems and optimizing for inference performance
Strong understanding of interpretation workflows, multilingual challenges, and speech quality metrics

Preferred

Experience building speech-to-text pipelines or hybrid ASR + LLM systems
Familiarity with real-time audio processing or latency-sensitive applications

Company

United Language Group

twittertwittertwitter
company-logo
United Language Group is one of the largest translation and localization providers in the world.

H1B Sponsorship

United Language Group has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2020 (1)

Funding

Current Stage
Late Stage
Total Funding
$1.8M
Key Investors
Centers for Medicare & Medicaid Services
2024-10-09Acquired
2024-09-17Grant· $1.8M
2016-06-06Private Equity

Leadership Team

leader-logo
Stephen Torgeson
Executive Vice President and Chief Technology Officer
linkedin
Company data provided by crunchbase