Johnson & Johnson · 1 day ago
Senior Principal Data Scientist - Cataloging & Metadata
Johnson & Johnson is a leading healthcare innovation company focused on improving health through advanced solutions. They are seeking a Senior Principal Data Scientist to design and implement AI solutions for data cataloging and metadata governance, ensuring high data quality and usability for analytics and decision-making.
Hospital & Health Care
Responsibilities
Own solutioning, developing and implementing solutions for the data cataloging, metadata and governance team
Lead the curation and ongoing management of the enterprise data catalog by capturing, validating, and enriching metadata from diverse sources, ensuring that business terms, data elements, and approved definitions are documented in collaboration with Data Owners and SME’s
Monitor catalog adoption and usage, continuously enhancing catalog usability and searchability so that all critical datasets, data products, and master data entities are indexed, discoverable, and accurately described
Implement rigorous data quality assessments, applying validation and enrichment techniques to maintain the reliability, accuracy, and contextualization of metadata throughout the data lifecycle
Develop and monitor KPIs for metadata quality, completeness, and compliance across domains
Works closely with cross-functional teams—including Knowledge Management, Data Products, and other groups—to integrate catalog automation and metadata capabilities into broader enterprise workflows, supporting seamless data accessibility and governance
Partner with the DSDH teams to implement automated data governance monitoring and reporting processes
Contribute to proof-of-concept/pilot/launch projects that assess data governance and metadata improvements and quantify business value achieved through enhanced data governance
Partner with the DSDH teams to implement automated data governance solutions, monitoring and reporting processes
Participate with ontologies and knowledge graph initiatives to ensure metadata is harmonized with enterprise semantic frameworks
Collaborate with the JJ Technology, legal, Compliance, external vendors and other DSDH teams to ensure alignment and traceability between business definitions, technical metadata, and lineage
Design, develop, and deploy generative AI solutions that are integrated with cataloging workflows, further improving the discoverability, accessibility, and overall effectiveness of the data
Establish and configure integrated connections between multiple cataloging platforms to enable seamless data synchronization and automated metadata updates, reinforcing traceability and discoverability across systems
Qualification
Required
Masters/PhD in Lifesciences with master's in computer science, Data Science, Information Systems (or equivalent degree)
7+ years of experience in computational biology, automation, data cataloging (platforms such as TileDB, Collibra, Alation etc), business analysis, data science or related fields preferably within Life Sciences or a regulated industry
Familiarity with data engineering, automation, data management, data compliance, quality, governance & AI Solutions
6+ years of hands-on experience in python, SQL and other AI automation tools
Strong python skills with API integration and backend development using FastAPI or Flask
Experience with data cataloging platforms and metadata extraction via APIs
Experience with databases(Snowflake, Postgres) & version control(GIT)
Hands-on experience building & deploying Gen AI
Strong troubleshooting skills across pipelines, APIs and dataflows
Strong stakeholder management skills with the ability to successfully drive solutions independently
Strong people management skills with the ability to mentor and guide resources
Strong communication skills with ability to seamlessly work across technical and business teams
Strong sense of ownership and accountability in managing critical tasks and responsibilities to ensure successful project outcomes
Preferred
Experience in setting up automations and building intelligent solutions (machine-readable metadata, profiling, validation rules, anomaly detection etc)
Excellent attention to detail, data organization, and documentation skills
Familiarity with automated metadata ingestion & catalog curation workflows
Ability to translate complex data concepts into clear, accessible documentation
Benefits
Medical
Dental
Vision
Life insurance
Short- and long-term disability
Business accident insurance
Group legal insurance
Company’s consolidated retirement plan (pension)
Savings plan (401(k))
Company’s long-term incentive program
Vacation –120 hours per calendar year
Sick time - 40 hours per calendar year; for employees who reside in the State of Washington –56 hours per calendar year
Holiday pay, including Floating Holidays –13 days per calendar year
Work, Personal and Family Time - up to 40 hours per calendar year
Parental Leave – 480 hours within one year of the birth/adoption/foster care of a child
Condolence Leave – 30 days for an immediate family member: 5 days for an extended family member
Caregiver Leave – 10 days
Volunteer Leave – 4 days
Military Spouse Time-Off – 80 hours
Company
Johnson & Johnson
At Johnson & Johnson, we believe health is everything.
H1B Sponsorship
Johnson & Johnson has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (48)
2024 (56)
2023 (58)
2022 (59)
2021 (44)
2020 (27)
Funding
Current Stage
Late StageLeadership Team
Recent News
2025-10-07
2025-10-07
Company data provided by crunchbase