Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Tencent · 2 months ago

Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language)

Tencent is a leading internet company in China, and their AI Lab in the Seattle Area focuses on advancing AI technologies. They are seeking research interns to develop novel multimodal processing techniques and contribute to projects aimed at solving core AI challenges.

AdvertisingInternetOnline GamesOnline PortalsSocial Media Marketing
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Work with researchers on a research project aimed at attacking one of the core problems by inventing cutting edge techniques
Encourage discussions and collaborations between researchers and interns
Publish the results from the internship
Develop more effective multimodal pretraining and post-training strategies for audio, speech, music, image, and video understanding and generation
Enable fully duplex conversations
Design more efficient large-model architectures
Enhance multimodal memory and reasoning capabilities
Advance novel audio, speech, music, image, and video processing techniques—such as encoding, tokenization, and representation learning—with a focus on multimodal applications and end-to-end large models

Qualification

Natural Language ProcessingSpeech ProcessingAudio ProcessingComputer VisionMachine LearningPythonC++Deep Learning ToolkitsResearch ExperiencePublication Track RecordIntellectual FlexibilityCreativitySelf-Motivated

Required

are Ph.D. students in computer science, electrical engineering, mathematics or a related field
are self-motivated and excited about developing novel techniques
have research experiences in natural language processing, speech, audio, and music processing, computer vision, dialog system, or machine learning
have good publication track records and history of creativity and intellectual flexibility
can program skillfully in Python and/or C++ and have experiences in using one of the leading deep learning toolkits

Benefits

1 hour of paid sick leave for every 30 hours worked
Up to 13 paid holidays throughout the calendar year
Eligible to enroll in the Company-sponsored medical plan

Company

Tencent is an internet service portal offering value-added internet, mobile, telecom, and online advertising services.

H1B Sponsorship

Tencent has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (3)
2024 (11)
2023 (2)
2022 (2)

Funding

Current Stage
Public Company
Total Funding
$13.84B
Key Investors
Lippo Group
2025-09-16Post Ipo Debt· $1.27B
2020-05-29Post Ipo Debt· $6B
2019-08-29Post Ipo Debt· $6.5B

Leadership Team

leader-logo
Dowson Tong
CEO of Tencent Cloud and Smart Industries Group (CSIG)
linkedin
leader-logo
James Mitchell
Chief Strategy Officer and Senior Executive Vice President
linkedin
Company data provided by crunchbase