Senior Data Engineer – Healthcare Data & AI Systems jobs in United States
cer-icon
Apply on Employer Site
company-logo

SideBy Care · 5 months ago

Senior Data Engineer – Healthcare Data & AI Systems

SideBy Care is the first AI-powered virtual care service for GI practices and their patients with Disorders of Gut-Brain Interaction. They are seeking a Senior Data Engineer to build and manage the data backbone of their platform, focusing on complex data systems that power analytics and clinical decision-making in healthcare.

Health CareTelehealthVirtual Assistant

Responsibilities

Architect and implement robust data pipelines between EMRs, internal systems, and Snowflake, ensuring scalability, reliability, and data provenance
Lead the design of warehouse schemas for multiple use cases: transactional processing, reporting (BI), and statistical/ML analysis
Define and enforce standards for data semantics, integrity, quality, lineage, and access control
Collaborate with data scientists and ML engineers to enable production-grade ML workflows (e.g., TensorFlow pipelines, model monitoring, A/B testing infrastructure)
Experiment with and support the deployment of LLMs to enable reasoning, summarization, and classification on structured and unstructured data (e.g., clinical notes)
Build monitoring and alerting around pipeline health and data trustworthiness
Integrate and normalize complex healthcare data sources (FHIR/HL7, custom APIs, third-party vendors) into a unified analytics model
Partner with engineering and product teams to deliver data-driven features, dashboards, and insights

Qualification

Data engineeringPythonSnowflakeHealthcare data integrationMachine learningData warehouse designData privacy practicesAWS servicesSoft skills

Required

5+ years of experience in data engineering or backend systems, with senior or staff-level contributions
Deep Python proficiency, with production experience in ETL, data validation, and orchestration frameworks (e.g., Airflow, Dagster, dbt)
Strong experience with data warehouse design, including star/snowflake schemas, denormalization strategies, and performance optimization
Strong understanding of data privacy and security practices, especially in healthcare (HIPAA, de-identification, audit logging, etc.)
Proven experience managing complex integrations with EMRs or clinical systems
Familiarity with LLM and ML development tools (e.g., TensorFlow, PyTorch, LangChain, transformers, vector DBs)
Experience deploying or supporting predictive models in production environments
Expertise in Snowflake or similar cloud data platforms (e.g., BigQuery, Redshift)
Strong grasp of data modeling, provenance, and semantics for analytical and AI purposes
Experience working with AWS services such as S3, Lambda, Batch, Event Bridge, Cloud Front, EC2, etc

Preferred

Experience working with graph-based reasoning engines or healthcare ontologies
Knowledge of analytics frameworks like Superset or Looker
Familiarity with HL7, FHIR, or other clinical interoperability standards
Exposure to real-time or streaming data systems (Kafka, Pulsar)

Benefits

Competitive pay
Flexible remote work culture

Company

SideBy Care

twittertwitter
company-logo
SideBy Care is a healthcare platform that specializes in virtual care services for gut health.

Funding

Current Stage
Early Stage
Company data provided by crunchbase