SIGN IN
Staff Software Engineer, TPU Performance jobs in United States
cer-icon
Apply on Employer Site
company-logo

Google · 14 hours ago

Staff Software Engineer, TPU Performance

Google is a leading technology company that develops next-generation technologies for billions of users. The Staff Software Engineer will focus on optimizing Tensor Processing Unit (TPU) fleet efficiency and collaborate with product teams to enhance performance metrics and solutions for large-scale machine learning models.
AppsArtificial Intelligence (AI)Cloud StorageSearch EngineSEO
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Focus on Tensor Processing Unit (TPU) fleet efficiency analysis and performance optimization, while identifying and maintaining Machine Learning (ML) training and serving benchmarks
Use the benchmarks to identify performance opportunities and drive out-of-the-box performance by improving the compiler, runtime, etc. in collaboration with partner teams
Collaborate with Google product teams and researchers to solve performance problems, such as onboarding new Machine Learning models and products onto new Tensor Processing Unit hardware to enable larger models to train efficiently at a very large scale
Analyze performance and efficiency metrics to identify bottlenecks, design, and implement solutions at Google fleet-wide scale
Explore model and data efficiency techniques i.e., model co-design, quantization, and sparsity

Qualification

Software developmentPerformance analysisML performance benchmarkingData structuresAlgorithmsHardware-aware algorithm designTechnical leadershipSoftware designDebuggingVisualization toolsCompiler stacksLarge-scale systemsCross-functional collaboration

Required

Bachelor's degree or equivalent practical experience
8 years of experience in software development
5 years of experience testing, and launching software products
5 years of experience with performance, large-scale systems data analysis, visualization tools, or debugging
3 years of experience with software design and architecture
Experience with ML performance analysis and benchmarking

Preferred

Master's degree or PhD in Engineering, Computer Science, or a related technical field
8 years of experience with data structures and algorithms
3 years of experience in a technical leadership role leading project teams and setting technical direction
3 years of experience working in a matrixed organization involving cross-functional, or cross-business projects
Experience optimizing for NVIDIA/AMD architectures through low-level programming, performance modeling, and bottlenecks analysis to maximize compute efficiency and memory hierarchy utilization
Experience in hardware-aware algorithm design and compiler stacks (e.g., OpenXLA), tailoring large-scale ML models and distributed systems for peak performance across accelerator hardware

Benefits

Bonus
Equity
Benefits

Company

Google specializes in internet-related services and products, including search, advertising, and software. It is a sub-organization of Alphabet.

H1B Sponsorship

Google has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (8763)
2024 (8872)
2023 (9682)
2022 (11626)
2021 (9109)
2020 (9785)

Funding

Current Stage
Public Company
Total Funding
$26.1M
Key Investors
Kleiner Perkins,Sequoia CapitalAndy Bechtolsheim
2004-08-19IPO
1999-06-07Series Unknown· $25M
1998-11-01Angel· $1M

Leadership Team

leader-logo
Sundar Pichai
CEO
linkedin
leader-logo
Thomas Kurian
CEO - Google Cloud
linkedin
Company data provided by crunchbase