Google · 5 hours ago
Senior Staff Software Engineer, Data Quality and ML Infrastructure
Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. As a Senior Staff Software Engineer, you will design and lead the implementation of a data infrastructure strategy while partnering with applied science and Machine Learning research leads to enable state-of-the-art model training.
AppsArtificial Intelligence (AI)Cloud StorageSearch EngineSEO
Responsibilities
Design and lead the implementation of a data infrastructure strategy capable of ingesting and processing petabytes of user data, optimizing for high-throughput Tensor Processing Unit (TPU) utilization and balancing storage costs with global availability
Partner with applied science and Machine Learning (ML) research leads to translating foundation model requirements into scalable, production-ready infrastructure that enables state-of-the-art model training
Build automated frameworks for schema enforcement, anomaly detection, and semantic drift monitoring to ensure data integrity across massive user datasets
Define and own strict service level objectives for data freshness and completeness, implementing defensive engineering patterns to shield downstream ML jobs from upstream corruption
Act as the primary technical lead while managing a small, agile group of executive engineers, overseeing the tech stack selection, code quality, and the roadmap for high-impact data reprocessing
Qualification
Required
Bachelor's degree or equivalent practical experience
8 years of experience in software development
7 years of experience leading technical project strategy, ML design, and working with ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning)
5 years of experience with design and architecture; and testing/launching software products
Experience with distributed storage and processing frameworks (e.g., Spark, Flink, Beam, Presto/Trino, Hadoop) and designing systems that handle petabyte-scale datasets with strict availability and latency requirements
Preferred
Master's degree or PhD in Engineering, Computer Science, or a related technical field
8 years of experience with data structures and algorithms
5 years of experience in a technical leadership role leading project teams and setting technical direction
3 years of experience working in a complex, matrixed organization involving cross-functional, or cross-business projects
Experience building data infrastructure for Large Language Models (LLMs) or Foundation Models. Understanding of pre-training vs. fine-tuning data requirements, tokenization at scale, and sequence modeling data structures
Understanding of modern table formats (Apache Iceberg, Hudi, Delta Lake) and columnar storage (Parquet, Avro, ORC)
Benefits
Bonus
Equity
Benefits
Company
Google specializes in internet-related services and products, including search, advertising, and software. It is a sub-organization of Alphabet.
H1B Sponsorship
Google has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (8763)
2024 (8872)
2023 (9682)
2022 (11626)
2021 (9109)
2020 (9785)
Funding
Current Stage
Public CompanyTotal Funding
$26.1MKey Investors
Andy Bechtolsheim
2004-08-19IPO
1999-06-07Series Unknown· $25M
1998-11-01Angel· $1M
Recent News
2026-01-22
2026-01-22
2026-01-22
Company data provided by crunchbase