Lead, Research Data Scientist - Cancer, Dr. Cullen lab jobs in United States
cer-icon
Apply on Employer Site
company-logo

Houston Methodist · 3 hours ago

Lead, Research Data Scientist - Cancer, Dr. Cullen lab

Houston Methodist is a leading academic institute focused on advancing healthcare through innovative research. The Lead Research Data Scientist will oversee the integration and management of clinical and research data for the Cancer Prevention and Control Program, ensuring compliance and data quality while facilitating research collaborations and optimizing data pipelines.

Health CareMedical
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design, implement, and maintain a state-of-the-art data management infrastructure that includes robust DataMart from multiple novel sources reflecting medical, social, genomic, and environmental drivers of health, consistent with the departmental mission
Engage in strategic negotiations and collaborations with key data providers, including institutional EMR teams, the Texas Cancer Registry, and other state and local agencies, to secure access to data critical to understanding the medical and socioecological contexts of health in patients with cancer. Liaise with multidisciplinary teams to ensure seamless data flow, integration, and alignment with project objectives
Oversee strategic guidance on data elements, methods, and models required for clinical and research objectives, and facilitate the identification and extraction of these elements from the EHR and data warehouse systems
Oversee data collection and harmonization processes, ensuring compliance with applicable industry data standards (e.g., FHIR, HL7, NAACCR)
Collaboratively develop and deploy quality assurance (QA) and quality control (QC) measures to maintain data integrity and reliability
Apply statistical and epidemiological methods to analyze data on health outcomes and survival in patients with cancer. Develop and adhere to statistical analysis plans (SAP), prepare research presentations, and draft results for peer-reviewed publications
Monitor emerging trends and developments in real-world clinico-omic data informatics and provide strategic recommendations on technologies and processes to optimize and advance the CPC data ecosystem
Prepare and present data insights, integration strategies, and progress reports to internal and external stakeholders
Enhance scientific discovery by contributing to hypothesis generation, cohort identification, publications, presentations, and proposals that advance CPC program goals
Establish a streamlined process for research approval that includes developing an umbrella IRB protocol and standardized data-use agreements for the novel data warehouses to facilitate quicker start times for research projects
Manage an expanding team with varied expertise within the Clinical Informatics and Data Sciences Group. Train, mentor, and supervise staff on data workflow processes, good practices, and standards
Strong understanding of data interoperability standards (e.g., FHIR, HL7, OMOP, CDISC, NAACCR)
Knowledge of regulatory frameworks such as HIPAA, GDPR, and GCP
Familiarity with cancer genomics repositories and/or cancer registries is highly desirable
Proficiency with data integration tools and platforms (e.g., ETL processes, APIs, databases)
Knowledge of programming languages or tools commonly used in data analysis and integration (e.g., Python, R, SQL, Tableau)
Knowledge of EPIC, PACS, LIS, and other HealthIT/business systems
Deep understanding of AI lifecycle Management, AI/ML Platform architecture
Familiarity with cloud-based data environments (e.g., AWS, Google Cloud, or Azure)
Familiarity with terminology systems, including SNOMED, LOINC, RxNorm, and ICD, to encode and query healthcare data
Manages multiple client/researcher relationships by leading the delivery of innovative strategies and solutions for our clients. Assigns projects to other data scientists
Oversees project progress and presents results to the stakeholders
Mentors and guides less experienced members of the data-science and analytics teams in their development of skills and proficiencies to be more effective in their roles
Drives contributions towards improvement of department scores for employee engagement, i.e., peer-to-peer accountability
Leads the research data analytics insights across the hospital system
Leads/co-leads a data science/research/business team targeted at fulfilling the organization's needs to access and understand research, clinical, financial, operational, population health, social, behavior, and marketing data
Leads a portfolio of advanced and complex projects that require more experience and expertise. Assigns the execution of multiple, complex analytical plans and projects and leads lower-level data scientists
Leads, as subject matter expert, on clinical data quality issues and assurance as well as patient privacy and data security and compliance
Oversees documenting the project requirements and results on a timely basis and reviews documentation and code submitted by lower-level research data scientists
Facilitates quality improvement process evaluations to improve data research analytics. Reviews presentation ideas prior to being viewed by leadership
Provides guidance to HDSA leadership on strategic and infrastructural needs for the organization
Leads the continuous improvement of clinical and operational data and analytics
Actively seeks out innovative solutions to the many challenges facing researchers in the clinical outcomes space and explores solutions to collaboration across and external to the institution. solutions with the clinicians, hospital administration, and other stakeholders
Translates the findings into implementable informatics solutions with the clinicians, hospital administration, and other stakeholders, leading assisting the Technical and Research Data Scientists with these tasks
Seeks opportunities to identify self-development needs and takes appropriate action. Ensures own career discussions occur with appropriate management. Completes and updates the My Development Plan on an on-going basis

Qualification

Data management infrastructureStatistical analysisData integration toolsHealth data standardsProgramming languagesAI/ML Platform architectureCloud computingCancer genomics knowledgeResearch collaborationQuality assuranceTeam managementCommunication skills

Required

Master's degree in computer science, Clinical Informatics, Public Health Administration, Business Administration, or Data Science, or Engineering, or an MD with experience in Informatics and Statistics or related field
Seven years' experience in health care data analytics and/or database management in a healthcare organization
Advanced skill in fields of computer science or mathematics and applications, modeling, statistics, and analytics
Strong understanding of data interoperability standards (e.g., FHIR, HL7, OMOP, CDISC, NAACCR)
Knowledge of regulatory frameworks such as HIPAA, GDPR, and GCP
Familiarity with cancer genomics repositories and/or cancer registries is highly desirable
Proficiency with data integration tools and platforms (e.g., ETL processes, APIs, databases)
Knowledge of programming languages or tools commonly used in data analysis and integration (e.g., Python, R, SQL, Tableau)
Knowledge of EPIC, PACS, LIS, and other HealthIT/business systems
Deep understanding of AI lifecycle Management, AI/ML Platform architecture
Familiarity with cloud-based data environments (e.g., AWS, Google Cloud, or Azure)
Familiarity with terminology systems, including SNOMED, LOINC, RxNorm, and ICD, to encode and query healthcare data
Demonstrates the skills and competencies necessary to safely perform the assigned job, determined through on-going skills, competency assessments, and performance evaluations
Sufficient proficiency in speaking, reading, and writing the English language necessary to perform the essential functions of this job, especially with regard to activities impacting patient or employee safety or security
Ability to effectively communicate with patients, physicians, family members and co-workers in a manner consistent with a customer service focus and application of positive language principles
Demonstrated advanced proficiency in SQL, Python, R or other common data-science tools and languages
Strong research and business acumen
Mastery in ability to communicate findings to internal and external business and IT leaders in a way that can influence how an organization approaches a business challenge
Maintains current Human Subjects Research credentials as defined by the Houston Methodist Research Institute

Preferred

PhD degree preferred
Experience with cloud computing and working with big-data and common health care data models strongly preferred
Credentials or experience working with AWS and/or i2b2 or other common clinical data models strongly preferred
EPIC - Certification (EPIC) -- Current Epic Certification or Proficiency in Clarity, Caboodle and Cogito preferred

Company

Houston Methodist

company-logo
Houston Methodist is one of the nation’s leading health systems and academic medical centers.

H1B Sponsorship

Houston Methodist has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (15)
2024 (11)
2023 (14)
2022 (12)
2021 (10)
2020 (10)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Brooke Graham
CEO Project Director
linkedin
leader-logo
David P. Bernard
Chief Executive Officer & Senior Vice President
linkedin
Company data provided by crunchbase