SIGN IN
Data Science Intern jobs in United States
cer-icon
Apply on Employer Site
company-logo

Prolaio · 18 hours ago

Data Science Intern

Prolaio is creating smarter ways to address heart disease and heart risks through a connected platform enabled by smart data science. As a Data Science Intern, you will develop and validate pipelines for extracting clinical endpoints from Electronic Health Record data, contributing to the improvement of patient care and outcomes.
Big DataHealthcareAnalyticsHealth CareMedical
check
H1B Sponsor Likelynote

Responsibilities

Develop Python/LLM workflows, including workflows built on purpose-built clinical extraction tools, to ingest unstructured xCures data (clinical notes, discharge summaries) and extract key study endpoints, specifically Clinical Events or 'Unified Problem Lists'
Design and conduct a human review validation study comparing LLM-generated abstractions against a 'gold standard' dataset derived from manual chart review
Build and maintain a documented code repository that inputs raw xCures EHR data and outputs structured clinical datasets for study data
Analyze pipeline performance to establish concordance, sensitivity, and specificity metrics, delivering a final validation report with performance metric for multiple approaches
Collaborate with clinical and technical mentors to translate clinical requirements into technical solutions

Qualification

PythonLarge Language Model (LLM)Natural Language Processing (NLP)Electronic Health Records (EHR)Analytical Skills

Required

Currently enrolled in a Master's or graduate-level program in Computer Science, Data Science, Biomedical Informatics, Bioengineering, Computational Biology, or a related field
Strong proficiency in Python programming with experience using Large Language Model (LLM) APIs
Familiarity with Natural Language Processing (NLP) concepts, specifically Prompt Engineering
Experience handling unstructured text data, cleaning messy real-world data, and/or working with human evaluation datasets
Ability to handle edge cases in text (e.g., negation) and validate one's own output using standard validation metrics

Preferred

A basic understanding of clinical terminology, Electronic Health Records (EHR), or biomedical data is highly preferred

Company

Prolaio

twittertwitter
company-logo
Prolaio is a HealthTech company that focuses on prescriptive analytics.

H1B Sponsorship

Prolaio has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (2)

Funding

Current Stage
Growth Stage
Total Funding
$25.5M
2025-03-27Acquired
2023-07-26Series Unknown· $25.5M
Company data provided by crunchbase