innoVet Health (SDVOSB) · 1 day ago
Senior Data Engineer
InnoVet Health is a small and growing business that provides health IT professional services to the Department of Veterans Affairs. They are seeking a Senior Data Engineer to build scalable data pipelines and support healthcare analytics, interoperability, and data integration needs, directly impacting VA healthcare delivery.
Health CareHospital
Responsibilities
Gather and translate business, technical, and functional requirements into data architecture and pipeline design decisions. Design and develop Azure Data Factory and Databricks-based ETL/ELT pipelines using PySpark, Delta Lake, and medallion/lakehouse architecture
Ingest and transform healthcare data (clinical, claims, FHIR, HL7, EHR, ADT, PGHD) from diverse sources
Build secure, scalable solutions using Azure Data Lake Storage, ADF, and Event Hubs, and related services, with attention to latency and reliability requirements
Implement data quality, lineage, and governance using Microsoft Purview
Optimize Databricks jobs (performance tuning, cluster sizing, Z-ordering, partitioning)
Enforce HIPAA-aligned security practices: RBAC, Key Vault, private endpoints, PHI protection
Collaborate with data scientists, analysts, and clinical informatics teams
Stay up to date with emerging technologies and trends in data engineering and healthcare data management
Present and discuss results with IT and business stakeholders
Participate in company growth and other responsibilities, as assigned
Qualification
Required
Bachelor's or master's degree in computer science, data analytics, or related field
Minimum 6+ years data engineering experience; 4+ years hands-on with Azure and 2+ years hands-on with Databricks
Strong skills in PySpark, Delta Lake, SQL, and distributed data processing
Experience with healthcare data standards (FHIR, HL7, X12/EDI, CCD, claims data, PGHD)
Strong understanding of HIPAA, PHI handling, and secure data architecture
Experience with ADF, ADLS Gen2, Azure Functions, and event-driven ingestion
Strong understanding of data modeling for analytics (dimensional + lakehouse)
Excellent problem-solving, collaboration and communication skills
Green card or US citizen required because of government contract work
No 1099 or corp-to-corp or international outsourcing or staffing agencies
Preferred
Experience with Federal EHR (VistA and Oracle Health) data
Experience with Azure Event Hubs, Stream Analytics, AWS Kinesis, or similar data streaming platforms is also a plus