Johnson & Johnson MedTech · 1 day ago
Principal Data Engineer
Johnson & Johnson MedTech is committed to healthcare innovation, aiming to improve health through advanced solutions. The Principal Data Engineer will design, build, and maintain data infrastructure for analytics, AI, and machine learning, while ensuring data quality and governance across the enterprise.
Hospital & Health Care
Responsibilities
Work closely with data scientists to understand their data needs and enable rapid experimentation through efficient, reusable pipelines
Provide engineering support for feature engineering, data preparation, and production model integration
Help create shared tools, libraries, and templates for AI/ML projects
Design, develop, and maintain ETL/ELT processes and automated data pipelines across structured and unstructured sources
Optimize data workflows for performance, scalability, and cost efficiency in cloud environments
Implement robust data quality monitoring and logging solutions
Build data architecture frameworks that support advanced analytics, AI, and reporting use cases
Collaborate with data governance leads to ensure adoption of updated data policies across all engineering workflows
Integrate data lineage tracking and metadata management into pipelines
Ensure compliance with data privacy laws, security policies, and responsible AI guidelines
Drive internal education and best practice sessions on compliant data handling
Perform other related tasks as assigned by management
Qualification
Required
Master's Degree / PhD + 4 years' experience, OR Bachelor's Degree + 6 years' experience
Educational degree in quantitative field, such as Statistics, Mathematics, Computer Science, Data Science, Engineering, Economics, and/or related quantitative
Advanced proficiency in SQL and Python for data processing, transformation, and automation
Hands-on experience building and optimizing ETL/ELT pipelines using tools such as Apache Airflow, dbt, Spark, or similar
Strong understanding of data modeling, data warehousing, and distributed data processing frameworks
Experience working with cloud-based data platforms (Azure, AWS, or GCP) and managed services (e.g., Snowflake, BigQuery, Databricks)
Proven experience using Dataiku (or similar AI/analytics platforms) for end-to-end data preparation, pipeline orchestration, and production deployment
Ability to embed data governance principles directly into engineering workflows — including data cataloging, lineage, and quality monitoring — ideally within Dataiku
Excellent collaboration and communication skills for working with data scientists, data governance professionals, and business stakeholders
Experience interpreting and communicating analytic results to analytical and non-analytical business partners
Ability to travel both domestically and internationally may be required (~5-10%)
Ability to flex hours to accommodate multiple time zones when necessary
Preferred
Advanced Dataiku expertise, including project design, flow optimization, automation scenarios, API integrations, and platform administration
Experience setting Dataiku usage standards for large practitioner communities, including governance-compliant design patterns and role-based access controls
Background in machine learning operations (MLOps) or supporting AI/ML workloads with engineered features and automated pipelines
Familiarity with DataOps practices for CI/CD, testing, and monitoring of data pipelines
Experience implementing security and privacy controls for sensitive datasets, including HIPAA, GDPR, or similar compliance frameworks
Knowledge of streaming data technologies (Kafka, Kinesis, Spark Streaming)
Certifications in cloud platforms (Azure Data Engineer, AWS Big Data Specialty, Google Professional Data Engineer) or in Dataiku Core Designer / Advanced Designer
Experience working in large matrixed organization, with Global footprint
Benefits
Subject to the terms of their respective plans, employees are eligible to participate in the Company’s consolidated retirement plan (pension) and savings plan (401(k)).
Vacation –120 hours per calendar year
Sick time - 40 hours per calendar year; for employees who reside in the State of Colorado –48 hours per calendar year; for employees who reside in the State of Washington –56 hours per calendar year
Holiday pay, including Floating Holidays –13 days per calendar year
Work, Personal and Family Time - up to 40 hours per calendar year
Parental Leave – 480 hours within one year of the birth/adoption/foster care of a child
Bereavement Leave – 240 hours for an immediate family member: 40 hours for an extended family member per calendar year
Caregiver Leave – 80 hours in a 52-week rolling period10 days
Volunteer Leave – 32 hours per calendar year
Military Spouse Time-Off – 80 hours per calendar year
Company
Johnson & Johnson MedTech
At Johnson & Johnson MedTech, we are working to solve the world’s most pressing healthcare challenges through innovations at the intersection of biology and technology.