Data Scientist with NLP jobs in United States
cer-icon
Apply on Employer Site
company-logo

VBeyond Corporation ยท 3 days ago

Data Scientist with NLP

VBeyond Corporation is seeking a Data Scientist with expertise in NLP and healthcare applications. The role involves analyzing clinical textual data using AI-powered NLP techniques and developing NLP modules, while collaborating with engineering teams to enhance data workflows.

ConsultingCRMDeliveryHuman ResourcesInformation Technology

Responsibilities

Analyze and process clinical textual data using AI-powered NLP techniques and advanced machine learning models
Modify and improve current workflows by incorporating cutting-edge machine learning and deep learning algorithms, including leveraging large language models (LLMs) and tools like LangGraph for complex AI agentic workflows in healthcare contexts
Develop NLP modules within the NLP development team using programming or scripting languages such as Python
Conduct pre-processing and quality analysis for textual data inputs and validate performance of NLP outputs
Create systematic testing procedures, error-checking mechanisms, and user manuals for NLP modules
Build infrastructure for optimal extraction, transformation, and loading of data from diverse sources including MCP servers, using SQL and AWS big data frameworks such as EMR and Spark/pySpark
Collaborate with Engineering teams to ensure scalable and efficient data workflows using SQL and AWS big data technologies
Apply working knowledge of AWS services, particularly AWS Bedrock, to develop generative AI applications
Utilize relational databases such as PostgreSQL or MySQL for data storage and retrieval in NLP and AI workflows

Qualification

NLPAWS BedrockPythonSQLMachine learningDeep learningAWS EMRSpark/pySparkHealthcareLangGraphPostgreSQLMySQLGenerative AIHL7FHIRCCDAAutomated testingLangChainDocumentationCollaboration

Required

Expertise in Fine tuning using AWS Nova
Proficiency in Python and scripting languages for NLP and machine learning development
Hands-on experience with large language models and agentic workflow tools such as LangGraph
Strong understanding of clinical NLP techniques and experience with machine learning and deep learning models
Expertise in SQL and big data technologies including AWS EMR and Spark/pySpark
Practical knowledge of AWS services, especially AWS Bedrock for generative AI applications
Experience with relational databases such as PostgreSQL or MySQL

Preferred

Familiarity with generative AI applications in healthcare and related use cases
Understanding of healthcare data standards and terminologies such as HL7, FHIR, and CCDA
Experience in creating detailed documentation, user manuals, and technical specifications
Background in automated testing and validation frameworks for NLP outputs
Ability to collaborate effectively with cross-functional teams including engineering and products
Exposure to LangChain or similar frameworks for building intelligent agent workflows

Company

VBeyond Corporation

twittertwittertwitter
company-logo
VBeyond Corporation is a staffing and recruiting company specializing in emerging search and HR consulting services.

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Rajesh Khanna
President
linkedin
leader-logo
Sandeep Mitra
Director
linkedin
Company data provided by crunchbase