Software Engineer Graduate (Data Arch - Data Ecosystem ) - 2026 (PhD) jobs in United States
cer-icon
Apply on Employer Site
company-logo

TikTok · 2 weeks ago

Software Engineer Graduate (Data Arch - Data Ecosystem ) - 2026 (PhD)

TikTok is the leading destination for short-form mobile video. The Data Ecosystem Team is seeking a graduate to design and implement data architecture for large-scale recommendation systems, ensuring system reliability and performance while collaborating with machine learning teams.

Content CreatorsContent DiscoveryMedia and EntertainmentSocial MediaVideo
check
H1B Sponsor Likelynote

Responsibilities

Design and implement real-time and offline data architecture for large-scale recommendation systems
Build scalable and high-performance streaming Lakehouse systems that power feature pipelines, model training, and real-time inference
Collaborate with ML platform teams to support PyTorch-based model training workflows and design efficient data formats and access patterns for large-scale samples and features
Own core components of our distributed storage and processing stack, from file format to stream compaction to metadata management

Qualification

Distributed systemsApache FlinkLakehouse technologiesJava/Scala/C++Feature storageDebuggingPerformance tuningData versioningHBase/Kudu

Required

PhD or Master's degree in Computer Science or related technical field
Experience building large-scale distributed systems, preferably in storage, stream processing, or ML infrastructure
Solid understanding of Apache Flink internals, with hands-on experience in state management, connectors, or UDFs
Familiarity with modern Lakehouse technologies such as Apache Paimon, Iceberg, Delta Lake, or Hudi, especially around incremental ingestion, schema evolution, and snapshot isolation

Preferred

Experience in designing and optimizing Flink + Paimon architectures for unified batch/stream processing
Familiarity with feature storage and training data pipelines, and their integration with PyTorch, especially for large-scale model training
Knowledge of columnar file formats (Parquet, ORC, Lance) and how they are used in feature engineering or ML data loading
Proficiency in Java/Scala/C++, and strong debugging/performance tuning ability
Previous experience in Lakehouse metadata management, compaction scheduling, or data versioning is a plus
Knowledge of legacy data stores like HBase/Kudu is a bonus but not required

Benefits

Employees have day one access to medical, dental, and vision insurance
A 401(k) savings plan with company match
Paid parental leave
Short-term and long-term disability coverage
Life insurance
Wellbeing benefits
10 paid holidays per year
10 paid sick days per year
17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)

Company

TikTok is a short-form video entertainment app and social network platform. It is a sub-organization of ByteDance.

H1B Sponsorship

TikTok has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (979)
2024 (601)
2023 (387)
2022 (322)
2021 (133)
2020 (72)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
N Ali Mohamed
CEO
linkedin
leader-logo
Blake Chandlee
VP Global Business Solutions
linkedin
Company data provided by crunchbase