Hippocratic AI · 13 hours ago
Senior Data Engineer
Hippocratic AI is the leading generative AI company in healthcare, focused on transforming patient outcomes with a safety-first approach. The Senior Data Engineer will design and operate data systems that ensure the reliability and compliance of AI deployments in healthcare environments, working closely with ML, AI, and product teams.
Artificial Intelligence (AI)Foundational AIGenerative AIHealth CareInformation Technology
Responsibilities
Build & operate data platforms and pipelines (batch/stream) that feed training, RAG, evaluation, and analytics using tools like Prefect, dbt, Airflow, Spark, and cloud data warehouses (Snowflake/BigQuery/Redshift)
Own data governance and access control: implement HIPAA-grade permissioning, lineage, audit logging, and DLP; manage IAM, roles, and policy-as-code
Ensure reliability, observability, and cost efficiency across storage (S3/GCS), warehouses, and ETL/ELT—SLAs/SLOs, data quality checks, monitoring, and disaster recovery
Enable self-service analytics via curated models and semantic layers; mentor engineers on best practices in schema design, SQL performance, and data lifecycle. Partner with ML/Research to provision high-quality datasets, feature stores, and labeling/eval corpora with reproducibility (versioning, metadata, data contracts)
Qualification
Required
5+ years of software or data engineering experience, with 3+ years building data infrastructure, ETL/ELT pipelines, or distributed data systems
Deep experience with Python and at least one cloud data platform (Snowflake, DataBricks, BigQuery, Redshift, or equivalent)
Familiarity with orchestration tools (Airflow, prefect, dbt) and infrastructure-as-code (Terraform, CloudFormation)
Strong understanding of data security, access control, and compliance frameworks (HIPAA, SOC 2, GDPR, or similar)
Proficiency with SQL and experience optimizing query performance and storage design
Excellent problem-solving and collaboration skills — able to work across engineering, ML, and clinical teams
Comfortable navigating trade-offs between performance, cost, and maintainability in complex systems
Preferred
Experience supporting ML pipelines, feature stores, or model training datasets
Familiarity with real-time streaming systems (Kafka, Kinesis) or large-scale unstructured data storage (S3, GCS)
Background in data reliability engineering, data quality monitoring, or governance automation
Experience in healthcare, safety-critical systems, or regulated environments
Company
Hippocratic AI
Hippocratic AI is a healthcare technology company that develops safety-focused large-language models for medical applications.
H1B Sponsorship
Hippocratic AI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)
2024 (1)
Funding
Current Stage
Growth StageTotal Funding
$402MKey Investors
AvenirKleiner PerkinsNVentures
2025-11-03Series C· $126M
2025-01-09Series B· $141M
2024-09-19Series A· $17M
Recent News
2026-01-11
Company data provided by crunchbase