GSK · 4 hours ago
Director, Head of Research Data Integration & Analytics
GSK is a global biopharma company with a purpose to unite science, technology and talent to get ahead of disease together. The Director, Head of Research Data Integration & Analytics leads the strategic development and implementation of robust data analytical systems and advanced AI/ML solutions, ensuring high-quality data and actionable insights for infectious disease research.
BiotechnologyHealth CarePharmaceutical
Responsibilities
Provides strategic vision and leadership for research data integration and advanced analytics initiatives within VIDRU Data Sciences, with a focus on accelerating infectious disease research through FAIR data and AI/ML and transforming raw experimental outputs into analysis-ready data. This includes promoting the adoption of cutting-edge AI methodologies into daily operations and refining high-impact use cases
Leads and manages a multidisciplinary team of data scientists, data architects, and scientific software/research engineers, fostering a culture of high performance, scientific innovation, and continuous professional development
Directs the design, development, and implementation of robust, scalable integrated data systems and automated, product-grade data processing and integration pipelines (e.g., for cloud computing) to consolidate and harmonize diverse bio-clinical datasets, including multi-omics, preclinical, translational, and early clinical data
Establishes and enforces world-class data standards, quality control processes, and governance frameworks (FAIR principles) to ensure data integrity, reliability, and reusability across all VIDRU research initiatives, from lab bench to final analysis, in collaboration with Research Technologies
Drives the development and application of advanced analytical methodologies, including deep learning, biomedical computer vision, and predictive modeling, to extract deep biological and clinical insights from integrated datasets. This includes promoting collaborative knowledge sharing and aligning with tech providers on emerging innovations
Collaborates closely with DPLs, VDLs, PILs, TPLs, and clinical sciences teams, as well as lab scientists within Discovery Technologies and scientific areas, to understand their data needs and deliver integrated datasets robust, scalable analytical workflows
Partners with experimental scientists to optimize VIDRU data flows, ensuring high-quality data generation aligned with FAIR principles from the outset of experiments
Drives innovation in research data integration and predictive analytics by partnering closely with GSK's AIML, Research Tech and R&D Tech organizations, leveraging product-grade software development practices to scale successful research pipelines into reusable and sustainable assets
Ensures that data and analytical deliverables are at the highest research and industry standards regarding scientific excellence, quality, security, and timelines, translating complex data into actionable insights with reproducibility and reliability
Communicates complex data landscapes, integration strategies, and analytical findings effectively to internal and external stakeholders, acting as a bridge between biologists, data scientists, and IT engineers. This role also involves mentoring scientists in leveraging LLMs and other digital tools for research and development breakthroughs
Contributes to the definition and implementation of VIDRU Data Science scientific strategy, processes, and objectives, ensuring alignment with the Head of VIDRU Data Sciences and the overall GSK Vaccines & Infectious Diseases R&D strategy, maintaining digital fluency with RTech and the R&D Digital Network
Qualification
Required
PhD or equivalent experience. Data Science, Computer Science, Bioinformatics, Computational Biology, Statistics, Engineering, Scientific Software Engineering, or equivalent, with a strong focus on data systems, advanced analytics, and robust software development practices in a biomedical context
Established background and practical experience in designing, building, and managing complex data architectures, applying advanced AI/ML techniques, and deriving scientific insights from bio-clinical data, with a proven track record in developing scalable and reproducible scientific workflows
Publication record in relevant areas, demonstrating leadership in establishing integrated data environments and delivering impactful data-driven insights from datasets in an R&D setting
8+ years of relevant scientific experience, including four years of direct/matrix people management and international leadership responsibilities (e.g. principal investigator for international R&D projects in relevant areas)
Proven capacity to use the theoretical background and education to solve actual problems of R&D projects working and leading teams in cross-functional setting. This role shall act as a global reference person for the function and perform people management and coaching of global staff
Preferred
Demonstrated strong proficiency / publication record in one or more of the following areas: Designing and implementing robust, product-grade data systems and integration pipelines (e.g., cloud computing) for diverse bio-clinical datasets in cloud systems and HPC environments
Developing and enforcing FAIR data governance frameworks and data quality standards for scientific research data
Advanced AI/ML techniques including deep learning, biomedical computer vision, and predictive modeling on clinical and molecular data
Expertise in managing, integrating, and analyzing multi-omics data (genomics, transcriptomics, proteomics) and associated metadata, with deep understanding of data types like FASTQ, BAM, VCF
Molecular biology insight applied to data interpretation and model development, with familiarity with laboratory processes and experimental design
Proficiency in cloud-based data platforms and technologies (e.g., GCP, Azure) for large-scale scientific data processing and analytics
Strong programming skills in languages relevant to data systems and advanced analytics (e.g., Python, R)
Experience with version control and automated testing for scientific software development
Benefits
Health care and other insurance benefits (for employee and family)
Retirement benefits
Paid holidays
Vacation
Paid caregiver/parental and medical leave
Company
GSK
We are uniting science, technology and talent to get ahead of disease together. Our community guidelines: https://gsk.to/socialmedia
H1B Sponsorship
GSK has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (45)
2024 (56)
2023 (54)
2022 (53)
2021 (54)
2020 (72)
Funding
Current Stage
Public CompanyTotal Funding
$25.51MKey Investors
CARB-X
2021-03-02Grant· $18M
2020-09-23Grant· $7.51M
1978-01-13IPO
Recent News
2026-01-16
2026-01-13
South China Morning Post
2026-01-08
Company data provided by crunchbase