Staff Software Engineer, Emerging On-prem AI Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Google · 23 hours ago

Staff Software Engineer, Emerging On-prem AI Infrastructure

Google is a leading technology company that develops next-generation technologies impacting billions of users. In this role, you will work on integrated AI infrastructure systems, optimizing performance and building large AI clusters using the latest technologies for AI acceleration and cluster interconnects and networking.

AppsArtificial Intelligence (AI)Cloud StorageSearch EngineSEO
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Drive project success by setting the technical goal and roadmap
Set priorities and projects for a team that delivers features in a fast-moving environment for both internal customers (other engineering teams) and external customers
Ensure central responsibility is taken for diagnostics and troubleshooting of end-to-end supportability issues, to uncover and address complex technical problems, and the building of repair automation systems
Implement and govern the success metrics for the team, spanning Operational Plane metrics (e.g., Support case metrics, GSO case handling), and RMA/Spares metrics (e.g., swap and repair rate)

Qualification

C++Large-scale infrastructureSoftware designArchitectureCloud infrastructureDiagnosticsTroubleshootingLow-level system softwareProblem-solvingAdaptabilityLeadership

Required

Bachelor's degree or equivalent practical experience
8 years of experience programming in C++
5 years of experience testing, and launching software products
5 years of experience building and developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, storage, or hardware architecture
3 years of experience with software design and architecture

Preferred

Experience building cloud or systems level infrastructure spanning the entire hardware and software stack
Experience in end-to-end diagnostics, troubleshooting, and supportability, with experience leading SWAT team efforts for complex issues and developing long term sustainable solutions
Familiarity with Service Level Objectives (SLOs)/metrics measurement, logs/telemetry/metrics integration with tools for enhanced operator experience
Understanding of low-level system software, OS, firmware, low level networking, or hardware, etc., with a passion for building system skills
Ability to work in a changing environment and navigate ambiguity, and a track record of delivering solutions for subtle or complex technical problems

Benefits

Bonus
Equity
Benefits

Company

Google specializes in internet-related services and products, including search, advertising, and software. It is a sub-organization of Alphabet.

H1B Sponsorship

Google has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (8763)
2024 (8872)
2023 (9682)
2022 (11626)
2021 (9109)
2020 (9785)

Funding

Current Stage
Public Company
Total Funding
$26.1M
Key Investors
Andy Bechtolsheim
2004-08-19IPO
1999-06-07Series Unknown· $25M
1998-11-01Angel· $1M

Leadership Team

leader-logo
Sundar Pichai
CEO
linkedin
leader-logo
Thomas Kurian
CEO - Google Cloud
linkedin
Company data provided by crunchbase