Lead Data Engineer - Clinical AI jobs in United States
cer-icon
Apply on Employer Site
company-logo

Qualified Health · 5 hours ago

Lead Data Engineer - Clinical AI

Qualified Health is a company focused on transforming healthcare through Generative AI. They are seeking a Lead Data Engineer to design and build data transformation pipelines that convert raw clinical data into AI-ready features, enabling faster and more accurate clinical insights.

Artificial Intelligence (AI)Health CareMedical
check
H1B Sponsor Likelynote

Responsibilities

Design and build clinical annotation pipelines that extract conditions, medications, and procedures from unstructured clinical notes
Implement negation and temporal detection to distinguish current conditions from historical findings (critical for clinical decision-making)
Build business rules engines that classify medications, calculate risk scores, and apply clinical logic at scale
Integrate clinical reference data (drug databases, terminology mappings) into transformation pipelines
Optimize data structures to reduce LLM processing time and improve downstream AI performance
Build production-grade pipelines using PySpark and Databricks for large-scale clinical data processing
Implement data quality frameworks to validate clinical transformations and catch issues before they reach AI workflows
Design feature stores that serve pre-computed clinical features to ML models and LLM applications
Maintain pipeline observability with monitoring, alerting, and performance tracking
Partner with clinical SMEs to translate medical knowledge into data transformation logic
Define data contracts with AI team to ensure feature outputs meet LLM workflow requirements
Contribute to technical standards and best practices for clinical data engineering

Qualification

Data engineeringDatabricksClinical data transformationFeature engineeringHealthcare data experienceClinical text processingData quality mindsetHealthcare terminologyAzure cloud platformClinical NLP toolsRAG architecture patternsCollaboration

Required

8+ years of data engineering experience, with demonstrated expertise building production data pipelines
5+ years on Databricks, including PySpark, Delta Lake, and Unity Catalog
Healthcare data experience: Prior work with FHIR APIs, EHR databases, or claims data
Clinical text processing experience: Built pipelines that extract entities from unstructured clinical notes using tools like spaCy, medspaCy, or cloud NLP services
Feature engineering for ML/AI: Experience preparing data for machine learning models or LLM consumption
Data quality mindset: Track record implementing validation frameworks and monitoring for data pipelines
Healthcare terminology: Familiarity with ICD-10, RxNorm, SNOMED CT, LOINC
Epic Clarity experience: Direct work with Epic's relational database structure

Preferred

Azure cloud platform: Hands-on with Azure Databricks, Data Lake Storage, Service Bus
Clinical NLP tools: Experience with Azure Text Analytics for Health, Amazon Comprehend Medical, or similar
RAG architecture patterns: Understanding of vector databases and retrieval-augmented generation

Benefits

Robust medical/dental/vision insurance
Flexible working hours
Hybrid work options
Equity packages

Company

Qualified Health

twittertwitter
company-logo
Qualified Health is an integrated generative AI into a healthcare platform that offers continuous monitoring of algorithm performance.

H1B Sponsorship

Qualified Health has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Growth Stage
Total Funding
$30M
2025-01-08Seed· $30M
2024-05-24Pre Seed

Leadership Team

leader-logo
Justin Norden
Co-Founder and CEO
linkedin
leader-logo
Beau Norgeot
Co-Founder and Chief AI Officer
linkedin
Company data provided by crunchbase