Saviance · 2 days ago
Data Pipeline Engineer
Saviance is a healthcare and life sciences consulting partner specializing in data and AI solutions. They are seeking a Data Pipeline Engineer to design, build, and maintain scalable data pipelines that support analytics and AI initiatives, with a focus on healthcare data.
Information Technology & Services
Responsibilities
Design, develop, and maintain end-to-end data pipelines (batch and streaming) for structured and unstructured data
Build robust ETL / ELT workflows to ingest data from multiple sources including APIs, databases, files, and third-party systems
Implement data transformations, validations, and quality checks to ensure accuracy and reliability
Optimize pipeline performance, scalability, and cost efficiency
Work closely with data analysts, BI engineers, data scientists, and product teams to support downstream analytics and AI use cases
Ensure data pipelines comply with security, privacy, and HIPAA requirements where applicable
Monitor pipelines, troubleshoot failures, and implement alerting and recovery mechanisms
Contribute to data architecture decisions, documentation, and best practices
Qualification
Required
5+ years of experience building and supporting data pipelines in production environments
Strong experience with SQL and data modeling concepts
Hands-on experience with ETL/ELT frameworks and orchestration tools
Experience working with cloud platforms (Azure, AWS, or GCP)
Proficiency with data processing tools such as Azure Data Factory, Databricks, Spark, Airflow, or similar
Experience integrating data from APIs, flat files, relational databases, and cloud storage
Strong understanding of data quality, lineage, and pipeline reliability
Excellent problem-solving and communication skill
Preferred
Healthcare domain experience (provider, payer, clinical, claims, PHI data)
Experience with streaming data (Kafka, Event Hub, Kinesis, etc.)
Exposure to Snowflake, BigQuery, Redshift, or other cloud data warehouses
Familiarity with Python or Scala for data processing
Experience supporting BI tools (Power BI, Tableau, Looker)
Knowledge of CI/CD, DevOps, and Infrastructure-as-Code