Cellares · 2 months ago
Senior Data Quality Engineer (I, II, III)
Cellares is seeking an innovative and highly motivated Senior Data Quality Engineer who will contribute to the development of our advanced cell therapy manufacturing platform. The primary focus of this position is to ensure the accuracy, reliability, and integrity of data within our data platform, participating on a cross-functional team to design, build, and maintain automated testing frameworks to ensure data integrity at every stage of our data pipelines.
BiotechnologyLife ScienceManufacturingMedicalTherapeutics
Responsibilities
Build and maintain automated data validation tests using Databricks notebooks and tools like Pytest
Test data ingestion, transformation, and loading processes within the Databricks Lakehouse, specifically focusing on the Bronze, Silver, and Gold layers of the Medallion architecture
Implement tests for data accuracy, completeness, consistency, timeliness, and uniqueness at different points in the pipeline to catch data issues early
Reconcile data by comparing record counts, schemas, and values between source systems and target tables in Databricks
Implement automated data quality checks within data pipelines to ensure no data regressions occur with new code deployments
Implement automated monitoring and alerting for data quality metrics, identifying anomalies in data freshness, schema evolution, and volume
Work closely with data engineers and product owners to understand data requirements and ensure data quality meets business needs
Ensure compliance with data governance policies by building quality checks that validate data sensitivity, masking, and lineage, leveraging tools like Unity Catalog
Communicate project status and new discoveries in a clear and timely manner during daily stand-ups
Qualification
Required
Bachelor's or Master's in Computer Science, Electrical Engineering, or related field and 5+ years of relevant experience
Experience with data pipeline and data quality testing strategy and execution, with significant hands-on experience in the Databricks environment
Strong proficiency in Python for developing and executing data validation scripts
In-depth knowledge of Databricks, Delta Lake, and the Lakehouse architecture. Proficiency in writing complex SQL queries for data validation, reconciliation, and troubleshooting issues
Solid understanding of data warehousing concepts, including dimensional modeling (star/snowflake schemas)
Hands-on experience with Azure, including Azure storage and data services that integrate with Databricks
Ability to process data, interpret testing results and provide feedback to the team
Desire to be part of a rapidly evolving organization, with compelling technology, and taking products and processes to the next level
Self-awareness, integrity, authenticity, and a growth/entrepreneurial mindset
Benefits
Highly subsidized Medical, Dental, and Vision Plans
401(k) Matching
Free EV Charging
Onsite lunches
Stock options
Company
Cellares
Cellares is a life sciences technology company that develops the Cell Shuttle to automate cell therapy manufacturing.
H1B Sponsorship
Cellares has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (2)
2023 (2)
2022 (1)
2020 (1)
Funding
Current Stage
Growth StageTotal Funding
$355MKey Investors
Koch Disruptive TechnologiesEclipse Ventures
2023-08-23Series C· $255M
2021-05-05Series B· $82M
2020-10-29Series A· $18M
Recent News
Company data provided by crunchbase