Arine · 1 day ago
Staff Data Engineer
Arine is a rapidly growing healthcare technology and clinical services company focused on improving patient care through innovative software solutions. The Staff Data Engineer will lead the design and development of scalable data ingestion pipelines, utilizing expertise in Python and AWS to enhance data management across the organization.
Artificial Intelligence (AI)Health CareMedicalPharmaceutical
Responsibilities
Act as the team architect by leading system design reviews, offering recommendations, conducting comprehensive peer reviews, and demonstrating expert-level proficiency in Python and AWS services
Architect and implement scalable data ingestion pipelines that handle different file types into the Arine platform
Develop reusable components that integrate into data pipelines to increase efficiency and reduce future implementation time
Create configuration-driven, containerized toolsets that are easy to use and maintain across diverse engineering profiles
Work collaboratively with cross-functional teams to meet data requirements through ETL components
Design and maintain data transformation pipelines using DBT, including macros, incremental models, and DBT tests
Implement incremental data ingestion strategies for large-scale healthcare datasets
Build monitoring and alerting systems for data ingestion processes and overall pipeline health
Apply software engineering best practices, including test-driven development and modular design, to data infrastructure
Refactor and rebuild existing data ingestion processes to improve scalability and operational efficiency
Work with containerization technologies (Docker, Kubernetes) to create portable and maintainable data solutions
Identify and escalate inefficiencies within and across teams
Provide technical guidance and mentorship to junior engineers, and promote best practices and coding standards
Author and maintain high-quality technical documentation, and support junior engineers in doing the same
Collaborate with the DE Manager to report on DE contractor performance issues
Qualification
Required
10+ years working in data engineering, with a focus on large-scale data ingestion and infrastructure
Deep expertise in Python and modern data engineering tools
A track record of building automated, production-grade ETL processes using Python and dbt SQL
Strong understanding of ETL/ELT frameworks and distributed data processing
Hands-on proficiency with modern data technologies and comfort leveraging AI coding assistants to accelerate development, improve code quality, and enhance productivity
Skilled in data processing, validation, cleaning, and debugging
Strong capability integrating APIs for seamless data exchange between systems
Proven ability to handle and process varied file types and formats, including healthcare standards such as HL7, 834, 837, and NCPDP
Demonstrated success integrating and consolidating data from diverse source systems into a unified repository, including EHR and claims systems, via both file-based and API integrations
Comfort working with large-scale datasets (10GB+)
Strong capability implementing incremental processing and change data capture (CDC) methodologies
Extensive background designing scalable data architectures in AWS environments
Solid grounding in software engineering principles, including test-driven development, loose coupling, single responsibility, and modular design
Hands-on familiarity with containerization (Docker, Kubernetes) and building configuration-driven, maintainable systems
Proven ability to build tools and systems that diverse engineering profiles can operate through configuration rather than code changes
A passion for building new data infrastructure and continuously improving existing systems with robustness, maintainability, and operational excellence
Strong collaboration skills, with comfort partnering across technical and non-technical stakeholders
Excellent written and verbal communication, with the ability to explain technical infrastructure concepts to diverse audiences
Ability to pass a background check
Must live in and be eligible to work in the United States
Preferred
Familiarity with healthcare data and regulatory environments (HIPAA) as a plus
Company
Arine
Arine uses AI to optimize medication therapy, improving patient outcomes and reducing healthcare costs for payers and providers.
H1B Sponsorship
Arine has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (5)
Funding
Current Stage
Growth StageTotal Funding
$70MKey Investors
Town Hall Ventures111° West Capital
2025-05-16Series C· $30M
2022-08-17Series B· $29M
2022-08-17Debt Financing
Leadership Team
Recent News
2025-07-07
Company data provided by crunchbase