Software Engineer Intern (Data Ecosystem) - 2026 Summer (BS/MS) jobs in United States
cer-icon
Apply on Employer Site
company-logo

TikTok · 22 hours ago

Software Engineer Intern (Data Ecosystem) - 2026 Summer (BS/MS)

TikTok is the leading destination for short-form mobile video, and they are seeking a Software Engineer Intern for their Data Ecosystem Team. The role involves designing and implementing data architecture for large-scale recommendation systems, ensuring system reliability and performance for TikTok's vast user base.

Content CreatorsContent DiscoveryMedia and EntertainmentSocial MediaVideo
badNo H1Bnote

Responsibilities

Design and implement real-time and offline data architecture for large-scale recommendation systems
Build scalable and high-performance streaming Lakehouse systems that power feature pipelines, model training, and real-time inference
Collaborate with ML platform teams to support PyTorch-based model training workflows and design efficient data formats and access patterns for large-scale samples and features
Own core components of our distributed storage and processing stack, from file format to stream compaction to metadata management

Qualification

Lakehouse technologiesFlink + Paimon architecturesJava/Scala/C++Apache Flink internalsColumnar file formatsPyTorch integrationDebugging skillsPerformance tuningData versioning

Required

Currently pursuing an Undergraduate/Master in Computer Science or a related technical discipline
Able to commit to working for 12 weeks during Summer 2026
Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Familiarity with modern Lakehouse technologies such as Apache Paimon, Iceberg, Delta Lake, or Hudi, especially around incremental ingestion, schema evolution, and snapshot isolation

Preferred

Experience in designing and optimizing Flink + Paimon architectures for unified batch/stream processing
Familiarity with feature storage and training data pipelines, and their integration with PyTorch, especially for large-scale model training
Knowledge of columnar file formats (Parquet, ORC, Lance) and how they are used in feature engineering or ML data loading
Proficiency in Java/Scala/C++, and strong debugging/performance tuning ability
Previous experience in Lakehouse metadata management, compaction scheduling, or data versioning is a plus
Solid understanding of Apache Flink internals, with hands-on experience in state management, connectors, or UDFs

Benefits

Interns have day one access to health insurance, life insurance, wellbeing benefits and more.
Interns also receive 10 paid holidays per year and paid sick time (56 hours if hired in first half of year, 40 if hired in second half of year).
Interns who are not working 100% remote may also be eligible for housing allowance.

Company

TikTok is a short-form video entertainment app and social network platform. It is a sub-organization of ByteDance.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
N Ali Mohamed
CEO
linkedin
leader-logo
Blake Chandlee
VP Global Business Solutions
linkedin
Company data provided by crunchbase