Staff Software Engineer, Emerging On-prem AI Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Google · 8 hours ago

Staff Software Engineer, Emerging On-prem AI Infrastructure

Google is seeking a Staff Software Engineer to develop next-generation technologies that change how billions of users connect and interact with information. In this role, you will work on integrated AI infrastructure systems, building large AI clusters and optimizing performance for Google's services and Cloud offerings.

AppsArtificial Intelligence (AI)Cloud StorageSearch EngineSEO
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Drive project success by setting the technical goal and roadmap
Set priorities and projects for a team that delivers features in a fast-moving environment for both internal customers (other engineering teams) and external customers
Ensure central responsibility is taken for diagnostics and troubleshooting of end-to-end supportability issues, to uncover and address complex technical problems, and the building of repair automation systems
Implement and govern the success metrics for the team, spanning Operational Plane metrics (e.g., Support case metrics, GSO case handling), and RMA/Spares metrics (e.g., swap and repair rate)

Qualification

C++Software designLarge-scale infrastructureCloud infrastructureDiagnosticsTroubleshootingLow-level system softwareDistributed systemsNetworkingProblem-solvingAdaptability

Required

Bachelor's degree or equivalent practical experience
8 years of experience programming in C++
5 years of experience testing, and launching software products
5 years of experience building and developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, storage, or hardware architecture
3 years of experience with software design and architecture

Preferred

Experience building cloud or systems level infrastructure spanning the entire hardware and software stack
Experience in end-to-end diagnostics, troubleshooting, and supportability, with experience leading SWAT team efforts for complex issues and developing long term sustainable solutions
Familiarity with Service Level Objectives (SLOs)/metrics measurement, logs/telemetry/metrics integration with tools for enhanced operator experience
Understanding of low-level system software, OS, firmware, low level networking, or hardware, etc., with a passion for building system skills
Ability to work in a changing environment and navigate ambiguity, and a track record of delivering solutions for subtle or complex technical problems

Benefits

Health, dental, vision, life, disability insurance
Retirement Benefits: 401(k) with company match
Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment
Sick Time: 40 hours/year (statutory, where applicable); 5 days/event (discretionary)
Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks
Baby Bonding Leave: 18 weeks
Holidays: 13 paid days per year

Company

Google specializes in internet-related services and products, including search, advertising, and software. It is a sub-organization of Alphabet.

H1B Sponsorship

Google has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (8763)
2024 (8872)
2023 (9682)
2022 (11626)
2021 (9109)
2020 (9785)

Funding

Current Stage
Public Company
Total Funding
$26.1M
Key Investors
Andy Bechtolsheim
2004-08-19IPO
1999-06-07Series Unknown· $25M
1998-11-01Angel· $1M

Leadership Team

leader-logo
Sundar Pichai
CEO
linkedin
leader-logo
Thomas Kurian
CEO - Google Cloud
linkedin
Company data provided by crunchbase