Xealth · 2 hours ago
Senior Data Engineer
Xealth is revolutionizing healthcare by leveraging data and automation to empower care providers. The role involves designing, building, and scaling services for Xealth’s Analytics and Reporting Capabilities, focusing on data processing pipelines and analytics products.
AnalyticsData IntegrationHealth CareInformation TechnologyTherapeutics
Responsibilities
Data Modeling: Execute expert-level Data Modeling and Design, utilizing dimensional modeling and denormalization techniques specifically for analytic workloads
Data Ingestion: Ability to consume and process high-volume bounded and unbounded data, build robust Change Data Capture (CDC) mechanisms, and gather data from API calls and webhooks
Pipeline Design & Orchestration: Design, build, and optimize high-volume, real-time Streaming Data Pipelines utilizing PySpark and Databricks environments
Scalability & Maintenance: Maintain and scale large Data Lake Pipelines, ensuring high performance and cost-efficiency
Unit testing & Quality Assurance: Write comprehensive unit and integration tests for data pipelines to ensure code quality and production reliability
Cross-Functional Collaboration: Partner with product managers and EHR specialists to translate clinical user behaviors into rich, analytical datasets, unlocking critical insights that drive evidence-based improvements in healthcare processes
Technical Leadership: Contribute to code reviews, system design discussions, and technical decisions that raise the engineering bar across the team
Automation and AI in Development: Use AI-assisted coding tools like GitHub Copilot to streamline development, increase quality, and accelerate delivery
Qualification
Required
5+ years of professional experience building production-grade data pipelines and applications
Expert proficiency in Python, PySpark and SQL
Solid hands-on experience working with modern massively parallel data processing systems
Deep understanding of algorithms and data structures, with a specific focus on distributed computing principles (concurrency, partitioning, shuffling)
Proficient in diagnosing complex failures in distributed processing jobs (e.g., Spark executor errors, memory leaks, data skew) using logs, distributed tracing, and performance metrics
Deep practical knowledge of open table formats, such as Delta Lake
Proficiency with common big data file formats, including Apache Parquet and Apache Avro
Experience implementing Infrastructure as Code (IaC) principles and tools for the automated deployment and management of data pipelines
Hands-on experience designing robust data ingestion frameworks via RESTful APIs
Experience building event-driven architectures for real-time data flow
Experience designing and scaling cloud-native data platforms
Experience orchestrating data workloads using AWS and Kubernetes
Preferred
Prior experience in regulated industry with high security requirements
Good working understanding of Data Security principles, particularly regarding Protected Health Information (PHI) and sensitive data governance
Expertise building streaming data pipelines, leveraging stream processors such as Apache Kafka and Apache Flink
Experience implementing and utilizing Data Observability tools and practices to monitor data quality, lineage, and pipeline health
Experience building dashboards and visualizations to communicate data insights effectively
Benefits
Paid parental leave.
Comprehensive medical, dental, and vision policies. Xealth covers 100% of employee premiums.
Employee Assistance Programs.
Xealth provides your laptop and offers a home office stipend.
Generous learning & development opportunities for you to grow your skills and career.
401k Match: Xealth offers a dollar-for-dollar match up to 3%.
Flexible time off & 10 standardized holidays.
$500 yearly fitness stipend to spend on staying active.
Company
Xealth
Xealth is a digital health platform that enables healthcare organizations to prescribe and monitor health tools, programs, and services.
Funding
Current Stage
Growth StageTotal Funding
$56.94MKey Investors
MorningsideAdvocate Aurora EnterprisesCerner Capital,LRVHealth
2025-11-20Acquired
2025-03-25Series Unknown
2024-02-27Series Unknown· $3.44M
Recent News
2026-01-07
2025-10-17
Fierce Healthcare
2025-08-06
Company data provided by crunchbase