SIGN IN
Senior / Staff Data Engineer – Matching & Data Quality jobs in United States
info-icon
This job has closed.
company-logo

Tough Leaf · 15 hours ago

Senior / Staff Data Engineer – Matching & Data Quality

Tough Leaf is a company that helps general contractors and agencies connect with certified small, local, and diverse subcontractors. They are seeking a Senior or Staff-level Data Engineer to manage the full data lifecycle, ensuring high-quality data processing and integrity while leveraging AI tools to enhance efficiency and accuracy.
RetailConstructionSaaSHuman ResourcesB2BProcurementProfessional Networking
Hiring Manager
Hassan Aljanabi
linkedin

Responsibilities

Own ingestion & normalization for messy, multi-source subcontractor data
Build and harden enrichment pipelines (contacts, websites, capabilities), including automated refresh + backfills
Design and ship deduplication & entity resolution systems that prevent match corruption and “ghost firms”
Create data quality gates: validation rules, monitors, alerts, and safe rollouts that reduce manual QA
Improve how data is surfaced for matching & search (ranking signals, relevance, usability)
Productize enrichment so outcomes are repeatable, measurable, and scalable — not one-off magic
Use AI coding agents daily, but hold a high bar for correctness, testing, and review

Qualification

Data ingestionData normalizationDeduplication systemsEntity resolutionData quality gatesAI coding agentsWeb scrapingSearch & matchingProductized enrichmentStartup experience

Required

You've owned production data systems end-to-end (not just built one-off pipelines)
You think deeply about data quality, invariants, and failure modes
You've shipped deduplication, fuzzy matching, entity resolution, or golden-record systems
You're comfortable with schema drift, inconsistent naming, partial truth, and ambiguity
You have strong code review judgment, especially for logic and correctness
You're AI-native: you use coding agents daily — but you verify, test, and refactor ruthlessly

Preferred

Web scraping & crawling experience (especially resilient refresh systems)
Search & matching experience (ranking, relevance, retrieval systems)
Productized enrichment flows (website crawling, LLM cleanup/structuring, map data)
Startup experience where autonomy, speed, and ownership actually matter

Benefits

Competitive salary + meaningful early-stage equity
Health + Dental + Vision coverage
Work with a small, senior, high-candor team
Real ownership, real impact, real production systems
Build things customers rely on — not dashboards nobody reads

Company

Tough Leaf

twittertwittertwitter
company-logo
Tough Leaf is a platform that helps construction companies manage pre-construction processes.

Funding

Current Stage
Early Stage
Total Funding
$7.79M
2024-06-28Series A· $4.5M
2022-07-20Seed· $3.1M
2021-12-03Pre Seed· $0.18M

Leadership Team

leader-logo
Wissam Akra, MBA, PE, DBIA
CEO & Founder
linkedin
leader-logo
Amir Zahlan, P.E.
Co-Founder
linkedin
Company data provided by crunchbase