Student Researcher [Seed Vision – Multimodal Joint Modeling] – 2026 Start (PhD) jobs in United States
cer-icon
Apply on Employer Site
company-logo

ByteDance · 4 hours ago

Student Researcher [Seed Vision – Multimodal Joint Modeling] – 2026 Start (PhD)

ByteDance is a leading company in AI foundation models, focusing on advanced research and technological advancements. The role of Student Researcher involves conducting research on multimodal generative models and contributing to foundational models for visual generation.

ContentData MiningFoundational AIInternetSocial Media
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Conduct research on joint training of vision, language, and video models under a unified architecture
Develop scalable and efficient methods for autoregressive-style multimodal pretraining, supporting both understanding and generation
Explore cross-modal tokenization, alignment, and shared representation strategies
Investigate instruction tuning, captioning, and open-ended generation capabilities across modalities
Contribute to system-level improvements in data curation, model optimization, and evaluation pipelines

Qualification

PhD in Computer VisionResearch experience in multimodal learningPyTorchAutoregressive LLM trainingInstruction tuningBackground in model scalingIndependent research abilityPublications in top-tier conferences

Required

Currently pursuing a PhD in Computer Vision, Machine Learning, NLP, or a related field
Research experience in multimodal learning, large-scale pretraining, or vision-language modeling
Proficiency in deep learning frameworks such as PyTorch or JAX
Demonstrated ability to conduct independent research, with publications in top-tier conferences such as CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR

Preferred

Experience with autoregressive LLM training, especially in multimodal or unified modeling settings
Familiarity with instruction tuning, vision-language generation, or unified token space design
Background in model scaling, efficient training, or data mixture strategies
Ability to work closely with infrastructure teams to deploy large-scale training workflows

Benefits

Day one access to health insurance
Life insurance
Wellbeing benefits
10 paid holidays per year
Paid sick time (56 hours if hired in first half of year, 40 if hired in second half of year)
Housing allowance

Company

ByteDance

company-logo
ByteDance is a technology company that develops content creation platforms and services.

H1B Sponsorship

ByteDance has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1350)
2024 (1123)
2023 (775)
2022 (487)
2021 (417)
2020 (245)

Funding

Current Stage
Late Stage
Total Funding
$9.8B
Key Investors
Capital TodayG42Tiger Global Management
2025-11-20Secondary Market· $300M
2024-07-25Secondary Market
2023-03-14Secondary Market· $100M

Leadership Team

leader-logo
Jochen Bischoff
Head of Global Business Solutions - Africa
linkedin
leader-logo
Matty Lin
General Manager, Global Business Solutions, KR
linkedin
Company data provided by crunchbase