Research Scientist Graduate (Foundation Model-Speech-Multimodal Interactions) - 2026 Start (PhD) jobs in United States
cer-icon
Apply on Employer Site
company-logo

ByteDance · 3 hours ago

Research Scientist Graduate (Foundation Model-Speech-Multimodal Interactions) - 2026 Start (PhD)

ByteDance is a pioneering company focused on advanced AI foundation models. The Seed-Speech Team is seeking a Research Scientist Graduate to conduct research and development in speech/audio foundation models and collaborate with cross-functional teams to integrate findings into practical applications.

ContentData MiningFoundational AIInternetSocial Media
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Conduct cutting-edge research and development in speech/audio foundation models
Collaborate with cross-functional teams to identify key research areas and contribute to the development of innovative speech/audio models
Work with product development teams to integrate research findings into practical applications for ByteDance and other platforms
Collaborate on team-driven projects to address complex challenges and enhance the overall effectiveness of the research team

Qualification

Machine LearningDeep LearningSpeech RecognitionLarge Language ModelsPythonC/C++TensorFlowPyTorchDistributed ComputingAlgorithmsCollaboration

Required

Master's or PhD in computer science, mathematics, engineering or related field
Experience in one or more areas of machine learning and deep learning, including but not limited to: Full-Duplex Speech Models, Speech Language Models, Automatic Speech Recognition, Automatic Speech Translation, Speech/audio self-supervised learning and foundation models, Speaker recognition and verification, Speech emotion recognition, Multimodal foundation models, Large Language Model pretraining and finetuning

Preferred

Publications in top-tier ML/DL venues such as NeurIPS, ICLR, ICML, AAAI and speech venues such as ICASSP, ASRU, Interspeech
Deep understanding of Large Language models
Familiar with distributed computing and large scale model training
Familiar with deep learning frameworks such as Tensorflow and Pytorch
Familiar with engineering principles and best practices
Highly competent in algorithms and programming; Strong coding skills in C/C++ and Python
Ability to work collaboratively in a fast-paced, multi-functional environments

Benefits

Employees have day one access to medical, dental, and vision insurance
A 401(k) savings plan with company match
Paid parental leave
Short-term and long-term disability coverage
Life insurance
Wellbeing benefits
10 paid holidays per year
10 paid sick days per year
17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)

Company

ByteDance

company-logo
ByteDance is a technology company that develops content creation platforms and services.

H1B Sponsorship

ByteDance has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1350)
2024 (1123)
2023 (775)
2022 (487)
2021 (417)
2020 (245)

Funding

Current Stage
Late Stage
Total Funding
$9.8B
Key Investors
Capital TodayG42Tiger Global Management
2025-11-20Secondary Market· $300M
2024-07-25Secondary Market
2023-03-14Secondary Market· $100M

Leadership Team

leader-logo
Jochen Bischoff
Head of Global Business Solutions - Africa
linkedin
leader-logo
Matty Lin
General Manager, Global Business Solutions, KR
linkedin
Company data provided by crunchbase