Large Language Model Integration Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Sphere · 21 hours ago

Large Language Model Integration Engineer

Sphere is partnering with a major media holding to transform their workflow with legal and financial documents by building an intelligent assistant for deep document analysis. The Large Language Model Integration Engineer will design and optimize a Retrieval-Augmented Generation system, fine-tune LLMs, and integrate LLM capabilities into existing platforms.

AnalyticsBusiness IntelligenceCloud Data ServicesConsultingDeveloper ToolsMobile AppsSoftwareUX Design
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design, build, and optimize a production-grade Retrieval-Augmented Generation (RAG) system to ground LLM responses in the client's proprietary document repositories
Fine-tune and manage open-source or cloud-based LLMs using techniques like LoRA or prompt engineering to excel at domain-specific tasks (legal/financial analysis)
Develop, containerize, and deploy robust API endpoints to integrate LLM capabilities seamlessly into the client's existing SaaS platforms and internal tools
Architect and implement monitoring for LLM performance, cost, and latency; optimize inference pipelines using techniques like model quantization and caching
Establish guardrail systems to evaluate output accuracy, mitigate hallucinations, and maintain audit trails of model decisions for compliance

Qualification

NLPLarge Language ModelsLLM frameworksOrchestration toolsPythonVector databasesCloud platformsRAG system deploymentEvaluation frameworks for AI

Required

Experience with NLP and Large Language Models (3+ years in production settings)
Experience with LLM frameworks and orchestration tools (LangChain, LlamaIndex, Hugging Face transformers)
Practical knowledge of vector databases (Pinecone, Weaviate, pgvector) and embedding strategies
Experience with Python and experience building and deploying scalable APIs (5+ years in production settings)
Familiarity with cloud platforms (AWS, GCP, Azure) for MLOps

Preferred

A proven track record of successfully deploying a RAG system or LLM-powered application to production
Experience implementing evaluation frameworks and metrics for generative AI outputs

Company

Sphere

twittertwittertwitter
company-logo
Drive your sustainable digital transformation with focus on innovation and scale

H1B Sponsorship

Sphere has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2021 (3)
2020 (1)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Leon Ginsburg
CEO/Founder
linkedin
leader-logo
Alex Korenev
Community Engagement Partner
linkedin
Company data provided by crunchbase