Apply on Employer Site

ExpertsHub.ai · 6 hours ago

Artificial Intelligence Specialist

Raleigh, NC

Contract

Hybrid

Senior Level

7+ years exp

ExpertsHub.ai is seeking a hands-on AI Architect to design, build, and deploy production-grade Generative AI systems on AWS. The role involves architecting secure, scalable solutions and collaborating with engineering, data, and product teams to deliver LLM-powered applications.

Computer Software

Responsibilities

Architect and implement Generative AI solutions using LLMs (GPT, Claude, Mixtral, etc.)

Design and deploy Retrieval-Augmented Generation (RAG) pipelines for document Q&A and enterprise search

Build semantic search and embedding pipelines using vector databases (FAISS, OpenSearch, Pinecone)

Select and optimize LLM models, prompts, and inference strategies for accuracy, latency, and cost

Implement hallucination mitigation techniques (grounding, prompt constraints, validation layers)

Design secure, scalable architectures on AWS (Bedrock, SageMaker, Lambda, API Gateway, S3)

Fine-tune models using PEFT techniques (LoRA, QLoRA) when required

Partner with MLOps teams to productionize models with CI/CD, monitoring, and rollback

Optimize GenAI systems for cost, latency, and throughput

Collaborate onsite with cross-functional teams (3 days/week in Raleigh)

Qualification

Generative AI & LLMsAWS AI/ML servicesVector databasesPythonMLOps & ProductionPrompt engineeringSemantic embeddingsChunking strategiesModel versioningCI/CD for ML systems

Required

Strong understanding of LLM architectures and inference

Hands-on experience with RAG systems in production

Prompt engineering, temperature/top-p tuning

Knowledge of LoRA / QLoRA / PEFT techniques

Experience mitigating hallucinations and improving factuality

Semantic embeddings (Sentence-BERT, OpenAI, etc.)

Chunking strategies and metadata handling

Vector similarity search (cosine, dot-product)

Vector databases: FAISS, OpenSearch, Pinecone

AWS AI/ML services: Bedrock, SageMaker

Serverless & APIs: Lambda, API Gateway

Data storage: S3, DynamoDB

Security: IAM, KMS, VPC, CloudTrail

Experience designing enterprise-grade, compliant systems

Python (strong)

Experience with LangChain, Haystack, FastAPI (or similar)

Familiarity with async processing and caching layers

Model versioning and monitoring

CI/CD for ML systems

Rollback strategies and drift detection

Performance and cost monitoring

Bachelor's or Master's degree in Computer Science, AI/ML, or related field

7+ years in software/ML engineering, with 2+ years in GenAI/LLMs

Proven experience deploying AI systems to production

Preferred

Experience with knowledge graphs integrated into GenAI

PDF/document ingestion pipelines (OCR, Textract)

Multi-tenant GenAI architectures

Healthcare / Pharma / regulated industry experience

Exposure to self-hosted open-source LLMs

Company

ExpertsHub.ai

At ExpertsHub.ai, we bridge the gap between businesses and top-tier AI experts.

New York, New York, US

11-50 employees

https://www.expertshub.ai

Funding

Current Stage

Early Stage

Company data provided by crunchbase