Speech Software Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

ASAPP · 6 hours ago

Speech Software Engineer

ASAPP is a company focused on delivering AI-powered customer experiences. They are seeking a Speech Software Engineer to lead the architectural evolution of their voice infrastructure, focusing on building scalable, high-performance systems for real-time customer interactions.

Artificial Intelligence (AI)CRMCustomer ServiceEnterprise SoftwareSales Automation
check
H1B Sponsor Likelynote

Responsibilities

Architect & Modernize: Lead the design and implementation of a scalable, high-availability voice infrastructure that replaces legacy systems
Optimize Performance: Build and refine multi-threaded server frameworks capable of handling thousands of concurrent, real-time audio streams with minimal jitter and latency
Build for Scale: Deploy robust ASR > LLM > TTS pipelines that process thousands of calls concurrently
Stream Engineering: Develop robust logic for handling media streams, ensuring seamless audio data flow between clients and our ML models
System Observability: Build advanced monitoring and load-testing tools specifically designed to simulate high-concurrency voice traffic
Collaborate: Partner with Speech Scientists and Research Engineers to integrate state-of-the-art models into a production-ready environment

Qualification

ASR/TTS productsGolangPythonAudio processingKubernetesDockerCloud providersEvent-driven architectureBig DataObject-oriented designGrowth mindset

Required

5+ years of software engineering experience, with a proven track record of building and maintaining production-grade infrastructure
A background in building ASR/TTS products at scale that interact with foundational LLMs
Expert-level proficiency in Golang, Python, or willingness to learn
Deep understanding of audio processing, including sample rates, codecs (Opus, G.711), network protocols, and buffering strategies
Strong background in object-oriented design and the ability to architect systems that are both modular and performant
The ability to navigate and refactor large existing codebases while transitioning to new, more efficient architectures

Preferred

Hands-on experience with Kubernetes, Docker, and cloud providers (AWS/GCP/Azure) for deploying distributed speech services
Familiarity with event loops (Boost.Asio, uvloop) and asynchronous programming patterns
Experience with Hadoop, Spark, or Hive for analyzing massive datasets of speech logs to improve model accuracy

Company

Breakthroughs are born out of research and ASAPP is advancing AI to drive greater human productivity and automating the world's workflows

H1B Sponsorship

ASAPP has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (5)
2024 (3)
2023 (2)
2022 (7)
2021 (14)
2020 (12)

Funding

Current Stage
Growth Stage
Total Funding
$380M
2023-06-01Secondary Market
2021-05-19Series C· $120M
2020-05-01Series B· $185M

Leadership Team

leader-logo
Michael Lawder
Executive Advisor
linkedin
Company data provided by crunchbase