Large Model Inference Acceleration Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

TikTok ยท 2 months ago

Large Model Inference Acceleration Engineer

TikTok is the leading destination for short-form mobile video, and they are seeking an experienced Large Model Inference Acceleration Engineer to enhance the performance and scalability of large-scale generative AI models. The role involves designing and optimizing inference pipelines and collaborating with engineers to ensure seamless model integration into production environments.

Content CreatorsContent DiscoveryMedia and EntertainmentSocial MediaVideo
check
H1B Sponsor Likelynote

Responsibilities

Design and optimize large model inference pipelines for low-latency, high-throughput deployments across diverse hardware architectures through high-performance optimization technologies
Benchmark and profile deep learning models to identify performance bottlenecks and optimize computational resources
Collaborate with production engineers and infrastructure teams to ensure seamless integration of optimized models into production environments

Qualification

AI model optimizationPythonC++CUDAML compilersParallel computingTensorRTTritonCutlassTransformersDiffusion modelsDeep learningHigh-performance optimizationBenchmarkingProfilingCollaboration

Required

Master's or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or a related field
Strong software engineering skills, including proficiency in Python, C++, and CUDA
5+ years of experience in AI model inference optimization
Experience working with ML compilers, parallel computing optimization, graph fusion, CUDA kernel development and TensorRT/Triton/Cutlass for model inference acceleration
Knowledge of transformers and diffusion models

Benefits

Medical, dental, and vision insurance
A 401(k) savings plan with company match
Paid parental leave
Short-term and long-term disability coverage
Life insurance
Wellbeing benefits
10 paid holidays per year
10 paid sick days per year
17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)

Company

TikTok is a short-form video entertainment app and social network platform. It is a sub-organization of ByteDance.

H1B Sponsorship

TikTok has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (979)
2024 (601)
2023 (387)
2022 (322)
2021 (133)
2020 (72)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
N Ali Mohamed
CEO
linkedin
leader-logo
Blake Chandlee
VP Global Business Solutions
linkedin
Company data provided by crunchbase