TikTok ยท 3 days ago
Site Reliability Engineer Graduate (Compute Platform) - 2026 Start (BS/MS)
TikTok is the leading destination for short-form mobile video, and they are seeking talented individuals to join their Compute Platform SRE team in 2026. The role involves ensuring the reliability of major data warehouse products and services, optimizing performance, and collaborating with cross-functional teams.
Content CreatorsContent DiscoveryMedia and EntertainmentSocial MediaVideo
Responsibilities
Responsible for the reliability of all TikTok's major data warehouse products, services, and query engines, such as ClickHouse, Spark, Presto, Doris, etc
Uphold Service Level Agreements (SLAs): Ensure that all service level objectives and agreements from ByteDance's Data Platform services are met. Respond promptly to any system outages or issues
Continuous Performance Optimization: Analyze service performance and reliability patterns to identify potential performance bottlenecks. Implement proactive measures to prevent service disruptions. Work with development teams to optimize application performance, ensuring that services run efficiently and that resources are utilized effectively
Incident Management: Lead efforts to troubleshoot and resolve service incidents and postmortems. Coordinate with cross-functional teams to manage and mitigate service-impacting events
Infrastructure Automation: Automate infrastructure provisioning, scaling, and management processes to reduce manual interventions and improve service quality
Collaboration: Engage with product and development teams to integrate reliability and performance considerations into the software lifecycle
Capacity and Demand Planning: Assess and forecast infrastructure needs based on growth patterns and upcoming initiatives
Stay Updated: Keep current with industry trends, best practices, and emerging technologies related to site reliability and infrastructure engineering
Qualification
Required
Bachelor's Degree or above, in Computer Science, Engineering, or a related field
Passionate about computer science and Internet technology
In-depth understanding of Linux, computer networking, and databases
Proficient in common SRE/DevOps open-source toolsets, system monitoring tools, and container orchestration platforms like Kubernetes
Experience or familiarity with open-source or commercial technologies such as ClickHouse, Hadoop, Doris, Spark, Presto and Kubernetes
Strong coding skills in at least one scripting or programming language, including but not limited to Python, Shell, Java, Go, etc
Preferred
Excellent problem-solving skills and the ability to think critically under pressure
Strong customer-first mindset
Strong sense of ownership and easy to collaborate with
Benefits
Medical, dental, and vision insurance
401(k) savings plan with company match
Paid parental leave
Short-term and long-term disability coverage
Life insurance
Wellbeing benefits
10 paid holidays per year
10 paid sick days per year
17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)
Company
TikTok
TikTok is a short-form video entertainment app and social network platform. It is a sub-organization of ByteDance.
H1B Sponsorship
TikTok has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (979)
2024 (601)
2023 (387)
2022 (322)
2021 (133)
2020 (72)
Funding
Current Stage
Late StageRecent News
2026-01-06
2026-01-06
2026-01-06
Company data provided by crunchbase