Data Science - Agentic AI, Document Understanding Co-op jobs in United States
cer-icon
Apply on Employer Site
company-logo

Ancestry · 13 hours ago

Data Science - Agentic AI, Document Understanding Co-op

Ancestry is a human-centered company that connects people with their family history. They are seeking a highly motivated Agentic AI, Document Understanding Co-op to design and implement AI systems that extract and organize information from historical records, working closely with engineering teams to optimize and deploy solutions.

E-CommerceFamilyInternetSubscription Service
check
H1B Sponsor Likelynote

Responsibilities

Innovate with State-of-the-Art AI: Implement cutting-edge AI solutions for key Document Understanding tasks such as OCR/HTR, transcription, Named Entity Recognition (NER), Relation Extraction (RE), Coreference Resolution, Summarization, and Knowledge Graphs working with diverse genealogical and historical collections spanning newspapers, city directories, family history books, and vital records (i.e., birth, marriage, & death records)
Analyze and Optimize Multi-Modal Models: Evaluate the performance of multi-modal models in zero-shot and few-shot learning scenarios for comprehensive document understanding
Architect Agentic Systems: Design and implement multi-agent workflows using frameworks like LangChain, LangGraph, CrewAI, or AutoGen to automate complex multi-step reasoning tasks in historical document analysis
Evaluation & Observability: Establish 'LLM-as-a-Judge' frameworks and use tools like Arize Phoenix, DeepEval, or RAGAS to monitor for hallucination, drift, and bias
Collaborate on Cloud Deployment: Partner closely with ML Ops and Data Science Engineers to seamlessly deploy datasets, models, and pipelines in cloud environments
Communicate Insights Effectively: Clearly and confidently present your findings, deliverables, and proposed solutions to technical and non-technical audiences, including teams, stakeholders, and executives

Qualification

AI & LLMsPythonDocument UnderstandingMulti-agent workflowsCloud platformsCommunication skillsCollaboration

Required

Currently pursuing an advanced degree (Master's or PhD) in Computer Science, Data Science, Statistics, Mathematics, Linguistics, Engineering or related quantitative field with a strong data focus
Specialization in AI & LLMs including familiarity with foundational models such as GPT, Gemini, Qwen, Llama, Claude, etc
Experience with inference optimization, vLLM, LoRA, QLoRA, quantization, etc
Familiar with embeddings, vector databases, transformer models, with software development experience
Strong proficiency in Python and relevant tools and libraries, including transformer models, multi-modal models, and general NLP (e.g., Hugging Face Transformers, agentic frameworks and workflows, LangChain, LangGraph, CrewAI, AgentCore)

Preferred

Master's or PhD preferred in Computer Science, Data Science, Statistics, Mathematics, Linguistics, Engineering or related quantitative field with a strong data focus
Familiarity with cloud platforms and related AI/ML services such as Google Cloud Platform, GCP, Gemini API, Vertex AI, AWS EC2, S3, SageMaker, Model Registry, and Bedrock

Company

Ancestry

company-logo
Ancestry is a web-based platform that helps its users to create their own family tree and help them preserve and share their family history.

H1B Sponsorship

Ancestry has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (61)
2024 (60)
2023 (65)
2022 (99)
2021 (60)
2020 (47)

Funding

Current Stage
Public Company
Total Funding
$33.2M
Key Investors
Banneker Partners
2020-08-05Acquired
2016-04-01Post Ipo Equity
2012-10-01Post Ipo Equity

Leadership Team

leader-logo
Sriram Thiagarajan
EVP, Chief Technology Officer & Chief Information Officer
linkedin
leader-logo
Attica Alexis Jaques
SVP and GM US Marketing
linkedin
Company data provided by crunchbase