Prolaio · 14 hours ago
Data Science Intern
Prolaio is creating smarter ways to address heart disease and heart risks through a connected platform enabled by smart data science. As a Data Science Intern, you will develop and validate pipelines for extracting clinical endpoints from Electronic Health Record data, contributing to the improvement of patient care and outcomes.
AnalyticsHealth CareMedical
Responsibilities
Develop Python/LLM workflows, including workflows built on purpose-built clinical extraction tools, to ingest unstructured xCures data (clinical notes, discharge summaries) and extract key study endpoints, specifically Clinical Events or 'Unified Problem Lists'
Design and conduct a human review validation study comparing LLM-generated abstractions against a 'gold standard' dataset derived from manual chart review
Build and maintain a documented code repository that inputs raw xCures EHR data and outputs structured clinical datasets for study data
Analyze pipeline performance to establish concordance, sensitivity, and specificity metrics, delivering a final validation report with performance metric for multiple approaches
Collaborate with clinical and technical mentors to translate clinical requirements into technical solutions
Qualification
Required
Currently enrolled in a Master's or graduate-level program in Computer Science, Data Science, Biomedical Informatics, Bioengineering, Computational Biology, or a related field
Strong proficiency in Python programming with experience using Large Language Model (LLM) APIs
Familiarity with Natural Language Processing (NLP) concepts, specifically Prompt Engineering
Experience handling unstructured text data, cleaning messy real-world data, and/or working with human evaluation datasets
Ability to handle edge cases in text (e.g., negation) and validate one's own output using standard validation metrics
Preferred
A basic understanding of clinical terminology, Electronic Health Records (EHR), or biomedical data is highly preferred
Company
Prolaio
Prolaio is a HealthTech company that focuses on prescriptive analytics.
H1B Sponsorship
Prolaio has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (2)
Funding
Current Stage
Growth StageTotal Funding
$25.5M2025-03-27Acquired
2023-07-26Series Unknown· $25.5M
Recent News
2025-10-03
Company data provided by crunchbase