Senior Data Quality Engineer (I, II, III) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Cellares · 2 weeks ago

Senior Data Quality Engineer (I, II, III)

Cellares is an innovative company focused on advanced cell therapy manufacturing. They are seeking a Senior Data Quality Engineer to ensure the accuracy, reliability, and integrity of data within their data platform, participating in cross-functional teams and maintaining automated testing frameworks.

BiotechnologyLife ScienceManufacturingMedicalTherapeutics
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Build and maintain automated data validation tests using Databricks notebooks and tools like Pytest
Test data ingestion, transformation, and loading processes within the Databricks Lakehouse, specifically focusing on the Bronze, Silver, and Gold layers of the Medallion architecture
Implement tests for data accuracy, completeness, consistency, timeliness, and uniqueness at different points in the pipeline to catch data issues early
Reconcile data by comparing record counts, schemas, and values between source systems and target tables in Databricks
Implement automated data quality checks within data pipelines to ensure no data regressions occur with new code deployments
Implement automated monitoring and alerting for data quality metrics, identifying anomalies in data freshness, schema evolution, and volume
Work closely with data engineers and product owners to understand data requirements and ensure data quality meets business needs
Ensure compliance with data governance policies by building quality checks that validate data sensitivity, masking, and lineage, leveraging tools like Unity Catalog
Communicate project status and new discoveries in a clear and timely manner during daily stand-ups

Qualification

DatabricksPythonSQLData quality testingData warehousingAzureAnalytical skillsProblem-solvingTeam playerSelf-awarenessIntegrityGrowth mindset

Required

Bachelor's or Master's in Computer Science, Electrical Engineering, or related field and 5+ years of relevant experience
Experience with data pipeline and data quality testing strategy and execution, with significant hands-on experience in the Databricks environment
Strong proficiency in Python for developing and executing data validation scripts
In-depth knowledge of Databricks, Delta Lake, and the Lakehouse architecture
Proficiency in writing complex SQL queries for data validation, reconciliation, and troubleshooting issues
Solid understanding of data warehousing concepts, including dimensional modeling (star/snowflake schemas)
Hands-on experience with Azure, including Azure storage and data services that integrate with Databricks
Ability to process data, interpret testing results and provide feedback to the team
Desire to be part of a rapidly evolving organization, with compelling technology, and taking products and processes to the next level
Self-awareness, integrity, authenticity, and a growth/entrepreneurial mindset

Benefits

Highly subsidized Medical, Dental, and Vision Plans
401(k) Matching
Free EV Charging
Onsite lunches
Stock options

Company

Cellares

twittertwitter
company-logo
Cellares is a life sciences technology company that develops the Cell Shuttle to automate cell therapy manufacturing.

H1B Sponsorship

Cellares has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (2)
2023 (2)
2022 (1)
2020 (1)

Funding

Current Stage
Growth Stage
Total Funding
$355M
Key Investors
Koch Disruptive TechnologiesEclipse Ventures
2023-08-23Series C· $255M
2021-05-05Series B· $82M
2020-10-29Series A· $18M

Leadership Team

leader-logo
Fabian Gerlinghaus
Co-Founder & CEO
linkedin
leader-logo
Omar Kurdi
Co-Founder and President
linkedin
Company data provided by crunchbase