Speech Scientist Intern jobs in United States
cer-icon
Apply on Employer Site
company-logo

Zoom · 13 hours ago

Speech Scientist Intern

Zoom is a company focused on enhancing communication and collaboration through innovative technology. They are seeking a Research Scientist Intern to develop advanced speech understanding models and collaborate with cross-functional teams to deliver impactful projects in speech technology.

CollaborationInformation TechnologyMessagingSaaSVideo Conferencing
check
H1B Sponsor Likelynote

Responsibilities

Developing state-of-the-art speech understanding models on large-scale datasets for Zoom products, including ASR, TTS, voice agents, speech-to-speech translation, and speech LLMs
Devising novel techniques where off-the-shelf solutions are not available
Demonstrating technical judgment in model prototyping, training, optimization, and evaluation
Collaborating with cross-functional teams, including products and science engineering teams, to deliver high-impact projects
Contributing to research publications and technical presentations

Qualification

Speech recognitionSpeech synthesisSpeech processingDeep learningPythonML frameworksNatural language processingCollaboration skillsCommunication skills

Required

Currently pursuing a PhD in Computer Science, Electrical Engineering or related fields
Display knowledge in deep learning and hands-on programming skills in Python, shell scripts; have familiarity with ML frameworks such as PyTorch and TensorFlow
Demonstrate experience in speech recognition, speech synthesis, speech processing, natural language processing or related fields in academic research
Have domain expertise in one or more of the following areas: modern end-to-end ASR architectures, TTS and voice cloning, voice agents and conversational AI, speech-to-speech translation, speech LLMs, language modeling, decoding algorithms, personalization and adaptation, semi-/self-supervised learning, multilingual and robust systems, LLM-integrative speech models
Have experience with speech toolkits and libraries such as Kaldi/k2, ESPNet, NeMo, TorchAudio, SpeechBrain or similar frameworks is a plus
Have experience with large scale data processing and model training
Demonstrate strong collaboration and communication skills

Benefits

Variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health
Support work-life balance
Contribute to their community in meaningful ways

Company

Zoom

twittertwittertwitter
company-logo
Zoom is a software company that offers a communications platform that connects people through video, voice, chat, and content sharing.

H1B Sponsorship

Zoom has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (16)
2024 (178)
2023 (144)
2022 (259)
2021 (86)
2020 (34)

Funding

Current Stage
Public Company
Total Funding
$276M
Key Investors
ARK Investment ManagementSequoia CapitalEmergence Capital
2021-11-04Post Ipo Equity· $130M
2019-04-19Post Ipo Equity
2019-04-18IPO

Leadership Team

leader-logo
Eric Yuan
Founder & CEO
linkedin
leader-logo
Xuedong Huang
Chief Technology Officer
linkedin
Company data provided by crunchbase