Senior Data Scientist (NLP) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Clarivate · 2 days ago

Senior Data Scientist (NLP)

Clarivate is seeking a Senior Data Scientist specializing in Natural Language Processing (NLP) to join their Life Sciences & Healthcare team. This role involves designing scalable NLP workflows, implementing indexing and vectorization strategies, and developing prompting and fine-tuning frameworks to enhance AI-driven solutions.

AnalyticsInformation ServicesInformation TechnologyInnovation Management
check
H1B Sponsor Likelynote

Responsibilities

Design NLP Workflows: Develop scalable pipelines for text ingestion, cleaning, normalization, and tokenization to support downstream applications
Implement Indexing and Vectorization Strategies: Architect and maintain robust indexing systems and vector databases for semantic search and retrieval
Develop Prompting and Finetuning Frameworks: Create reusable prompting strategies and lead fine-tuning initiatives for LLMs tailored to business-specific tasks
Build LangChain/LangGraph Applications: Construct dynamic knowledge systems and agentic workflows using LangChain and LangGraph
Integrate Advanced RAG Architectures: Apply VRAG and GraphRAG design patterns to enrich information retrieval and contextual understanding
Conduct Performance Optimization: Perform benchmark testing and model evaluations to improve accuracy, efficiency, and scalability of NLP systems
Collaborate Across Teams: Work closely with engineering, product, and research stakeholders to deliver integrated AI-driven features
Provide Technical Leadership: Mentor junior data scientists, guide best practices, and drive innovation across AI projects

Qualification

Natural Language ProcessingPythonLangChainLangGraphRetrieval-Augmented GenerationMachine LearningEmbedding ModelsSemantic SearchCloud PlatformsMLOpsGraph Neural NetworksMultilingual NLPTechnical Leadership

Required

Bachelor's degree in Computer Science, Data Science, Computational Linguistics, or a related field
At least 5 years of hands-on experience in data science, focused on natural language processing (NLP)
At least 5 years of experience using Python, with expertise in NLP libraries such as LangChain, LangGraph, or other “Lang”-based toolkits
Proven experience in model development and applying machine learning techniques to real-world problems

Preferred

Expertise in retrieval-based LLM workflows (RAG, VRAG, GraphRAG)
Deep understanding of embedding models, semantic search, and vector stores (e.g., FAISS, Pinecone)
Experience with document loaders and text splitters/document splitting strategies
Familiarity with MLOps practices and production-level deployment of AI pipelines
Experience with cloud platforms (e.g., AWS, Azure, or GCP)
Experience applying Graph Neural Networks (GNNs) to retrieval-enhanced generation
Knowledge of LangSmith and vector orchestration platforms
Familiarity with multilingual NLP and cross-lingual embeddings
Exposure to real-time knowledge graphs and stream-based RAG systems
A Master's or PhD in a technical field (Computer Science, Data Science, etc.)

Benefits

Medical
Dental
Prescription drug
Life insurance
401k with match
Long term disability coverage
Vacation
Sick time
Volunteer time
Discount programs
And many more

Company

Clarivate

company-logo
Clarivate is a leading global provider of transformative intelligence.

H1B Sponsorship

Clarivate has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (16)
2024 (18)
2023 (12)
2022 (16)
2021 (20)
2020 (15)

Funding

Current Stage
Public Company
Total Funding
$94M
Key Investors
Elliott Management Corp.
2022-11-15Post Ipo Equity· $94M
2021-02-01IPO
2016-10-04Acquired

Leadership Team

leader-logo
Jonathan Collins
Executive Vice President and Chief Financial Officer
linkedin
leader-logo
Ketan Patel
Vice President, Cortellis Product Platform
linkedin
Company data provided by crunchbase