Google · 1 day ago
Staff Software Engineer, TPU Performance
Google is a leading technology company that develops next-generation technologies that change how billions of users connect and interact with information. The Staff Software Engineer will work on optimizing the performance of Machine Learning models on TPU systems and will engage with product teams to solve performance problems for new ML models and products.
Artificial Intelligence (AI)Cloud ComputingAppsMarketingCloud StorageSearch EngineSEO
Responsibilities
Identify and maintain ML training and serving benchmarks that are representative to Google production and broader ML industry
Achieve performance for customer launches, and in case of Third-Party/Open-Source Software (3P/OSS) models, for engaged benchmark submissions ML Commons, InferenceMAX, et cetera)
Use the benchmarks to identify performance opportunities and drive out-of-the-box performance toward improving the compiler, runtime, etc in collaboration with those teams
Engage with Google product teams and researchers to solve their performance problems (e.g., onboard new ML models and products on Google new TPU hardware, enabling larger models (giant models) to train efficiently on a very large-scale (that is, thousands of TPUs.))
Analyze performance and efficiency metrics to identify bottlenecks, design, and implement solutions at Google fleet-wide scale
Qualification
Required
Bachelor's degree or equivalent practical experience
8 years of experience in software development
5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture
5 years of experience with one or more of the following: Speech/audio (e.g., technology duplicating and responding to the human voice), reinforcement learning (e.g., sequential decision making), ML infrastructure, or specialization in another ML field
5 years of experience with ML design and ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning)
Preferred
Master's degree or PhD in Engineering, Computer Science, or a related technical field
8 years of experience with data structures and algorithms
Experience with machine learning, compiler optimization, code generation, and runtime systems for GPU architectures (OpenXLA, MLIR, Triton, etc)
Experience in tailoring algorithms and ML models to exploit ML accelerator architecture strengths and minimize weaknesses
Experience in low-level GPU programming (CUDA, OpenCL, etc.) and performance tuning techniques
Understanding of modern A Graphics Processing Unit (GPU), TPU or other ML accelerator architectures, memory hierarchies, and performance bottlenecks
Benefits
Bonus
Equity
Benefits
Company
Google specializes in internet-related services and products, including search, advertising, and software. It is a sub-organization of Alphabet.
H1B Sponsorship
Google has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (8763)
2024 (8872)
2023 (9682)
2022 (11626)
2021 (9109)
2020 (9785)
Funding
Current Stage
Public CompanyTotal Funding
$26.1MKey Investors
Kleiner Perkins,Sequoia CapitalAndy Bechtolsheim
2004-08-19IPO
1999-06-07Series Unknown· $25M
1998-11-01Angel· $1M
Recent News
Small Business Trends
2026-01-24
2026-01-24
Search Engine Journal
2026-01-24
Company data provided by crunchbase