Principal Data Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

SoTalent · 17 hours ago

Principal Data Engineer

SoTalent is seeking a Principal Data Engineer to lead and support complex data engineering initiatives across Research & Development. This role involves designing, developing, and maintaining data pipelines for diverse data sources, ensuring compliance and quality, and collaborating with various stakeholders.

Staffing & Recruiting

Responsibilities

Design, build, and maintain data pipelines for structured and unstructured data using AWS services, Python, R, and SQL
Create and optimize ETL/ELT processes for diverse healthcare and research data
Develop and manage data repositories (AWS S3, FSx) and data warehousing solutions (Amazon Redshift)
Build and maintain standard data models to support analytics and reporting
Implement data quality frameworks, validation processes, and KPIs
Ensure data versioning, lineage tracking, and compliance with regulatory requirements (HIPAA, GDPR)
Document architectures, workflows, and processes to ensure transparency and reproducibility
Apply best practices in software development (CI/CD, DevOps, code versioning)
Collaborate with researchers, data scientists, and stakeholders to deliver tailored solutions
Support or deliver data literacy training across R&D teams

Qualification

Data engineeringAWSPythonSQLRData modelingHealthcare data standardsAnalytical skillsAgile developmentMLOpsDockerKubernetesBig data technologiesProblem-solvingCommunication skills

Required

Bachelor's degree in Computer Science, Statistics, Mathematics, Life Sciences, or related field (Master's preferred)
3–5 years of experience in data engineering, including at least 1.5 years with healthcare, research, or clinical data
Strong skills in Python, R, SQL, and AWS (S3, Redshift, FSx, Glue, Lambda)
Experience with relational databases, data modeling, and database design
Familiarity with NoSQL, graph databases, Docker, Kubernetes, and big data technologies
Exposure to healthcare data standards (CDISC, HL7, FHIR, SNOMED CT, OMOP, DICOM)
Knowledge of MLOps and deploying machine learning models is a plus
Strong problem-solving, analytical, and communication skills
Experience in Agile development environments

Benefits

Discretionary bonus and long-term equity incentive eligibility.
Comprehensive benefits including medical, dental, vision, 401k, and flexible paid time off.

Company

SoTalent

twitter
company-logo
At SoTechTalent, we specialise in connecting forward-thinking tech companies with world-class talent.

Funding

Current Stage
Early Stage
Company data provided by crunchbase