SIGN IN
Staff Software Engineer, TPU Performance jobs in United States
cer-icon
Apply on Employer Site
company-logo

Google · 1 day ago

Staff Software Engineer, TPU Performance

Google is a leading technology company that develops next-generation technologies that change how billions of users connect and interact with information. The Staff Software Engineer will work on optimizing the performance of Machine Learning models on TPU systems and will engage with product teams to solve performance problems for new ML models and products.
Artificial Intelligence (AI)Cloud ComputingAppsMarketingCloud StorageSearch EngineSEO
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Identify and maintain ML training and serving benchmarks that are representative to Google production and broader ML industry
Achieve performance for customer launches, and in case of Third-Party/Open-Source Software (3P/OSS) models, for engaged benchmark submissions ML Commons, InferenceMAX, et cetera)
Use the benchmarks to identify performance opportunities and drive out-of-the-box performance toward improving the compiler, runtime, etc in collaboration with those teams
Engage with Google product teams and researchers to solve their performance problems (e.g., onboard new ML models and products on Google new TPU hardware, enabling larger models (giant models) to train efficiently on a very large-scale (that is, thousands of TPUs.))
Analyze performance and efficiency metrics to identify bottlenecks, design, and implement solutions at Google fleet-wide scale

Qualification

Machine LearningGPU ProgrammingSoftware DesignPerformance TuningData StructuresAlgorithmsML InfrastructureReinforcement LearningSoft Skills

Required

Bachelor's degree or equivalent practical experience
8 years of experience in software development
5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture
5 years of experience with one or more of the following: Speech/audio (e.g., technology duplicating and responding to the human voice), reinforcement learning (e.g., sequential decision making), ML infrastructure, or specialization in another ML field
5 years of experience with ML design and ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning)

Preferred

Master's degree or PhD in Engineering, Computer Science, or a related technical field
8 years of experience with data structures and algorithms
Experience with machine learning, compiler optimization, code generation, and runtime systems for GPU architectures (OpenXLA, MLIR, Triton, etc)
Experience in tailoring algorithms and ML models to exploit ML accelerator architecture strengths and minimize weaknesses
Experience in low-level GPU programming (CUDA, OpenCL, etc.) and performance tuning techniques
Understanding of modern A Graphics Processing Unit (GPU), TPU or other ML accelerator architectures, memory hierarchies, and performance bottlenecks

Benefits

Bonus
Equity
Benefits

Company

Google specializes in internet-related services and products, including search, advertising, and software. It is a sub-organization of Alphabet.

H1B Sponsorship

Google has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (8763)
2024 (8872)
2023 (9682)
2022 (11626)
2021 (9109)
2020 (9785)

Funding

Current Stage
Public Company
Total Funding
$26.1M
Key Investors
Kleiner Perkins,Sequoia CapitalAndy Bechtolsheim
2004-08-19IPO
1999-06-07Series Unknown· $25M
1998-11-01Angel· $1M

Leadership Team

leader-logo
Sundar Pichai
CEO
linkedin
leader-logo
Thomas Kurian
CEO - Google Cloud
linkedin
Company data provided by crunchbase