Software Engineer Graduate (Data Arch - Data Ecosystem ) - 2026 Start (BS/MS) jobs in United States
cer-icon
Apply on Employer Site
company-logo

TikTok ยท 2 weeks ago

Software Engineer Graduate (Data Arch - Data Ecosystem ) - 2026 Start (BS/MS)

TikTok is the leading destination for short-form mobile video, and they are seeking talented individuals to join their Data Ecosystem Team. The role involves designing and implementing data architecture for large-scale recommendation systems, ensuring system reliability and performance for TikTok's extensive user base.

Content CreatorsContent DiscoveryMedia and EntertainmentSocial MediaVideo
check
H1B Sponsor Likelynote

Responsibilities

Design and implement real-time and offline data architecture for large-scale recommendation systems
Build scalable and high-performance streaming Lakehouse systems that power feature pipelines, model training, and real-time inference
Collaborate with ML platform teams to support PyTorch-based model training workflows and design efficient data formats and access patterns for large-scale samples and features
Own core components of our distributed storage and processing stack, from file format to stream compaction to metadata management

Qualification

Distributed systemsLakehouse technologiesJava/Scala/C++Apache FlinkPyTorch integrationColumnar file formatsDebuggingPerformance tuningMetadata managementData versioning

Required

Bachelor's degree or above (or expected by 2026) in Computer Science or related technical field
Experience building large-scale distributed systems, preferably in storage, stream processing, or ML infrastructure
Familiarity with modern Lakehouse technologies such as Apache Paimon, Iceberg, Delta Lake, or Hudi, especially around incremental ingestion, schema evolution, and snapshot isolation

Preferred

Understanding of Apache Flink internals, with hands-on experience in state management, connectors, or UDFs
Experience in designing and optimizing Flink + Paimon architectures for unified batch/stream processing
Familiarity with feature storage and training data pipelines, and their integration with PyTorch, especially for large-scale model training
Knowledge of columnar file formats (Parquet, ORC, Lance) and how they are used in feature engineering or ML data loading
Proficiency in Java/Scala/C++, and strong debugging/performance tuning ability
Previous experience in Lakehouse metadata management, compaction scheduling, or data versioning is a plus
Knowledge of legacy data stores like HBase/Kudu is a bonus but not required

Benefits

Medical, dental, and vision insurance
401(k) savings plan with company match
Paid parental leave
Short-term and long-term disability coverage
Life insurance
Wellbeing benefits
10 paid holidays per year
10 paid sick days per year
17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)

Company

TikTok is a short-form video entertainment app and social network platform. It is a sub-organization of ByteDance.

H1B Sponsorship

TikTok has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (979)
2024 (601)
2023 (387)
2022 (322)
2021 (133)
2020 (72)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
N Ali Mohamed
CEO
linkedin
leader-logo
Blake Chandlee
VP Global Business Solutions
linkedin
Company data provided by crunchbase