Large Language Model Integration Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Sphere · 16 hours ago

Large Language Model Integration Engineer

Sphere is partnering with a major media holding to transform their workflow with thousands of legal and financial documents. The role involves building an intelligent assistant for deep document analysis and interaction, focusing on the design and optimization of a Retrieval-Augmented Generation system and the integration of LLM capabilities into existing platforms.

AnalyticsBusiness IntelligenceCloud Data ServicesConsultingDeveloper ToolsMobile AppsSoftwareUX Design
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design, build, and optimize a production-grade Retrieval-Augmented Generation (RAG) system to ground LLM responses in the client's proprietary document repositories
Fine-tune and manage open-source or cloud-based LLMs using techniques like LoRA or prompt engineering to excel at domain-specific tasks (legal/financial analysis)
Develop, containerize, and deploy robust API endpoints to integrate LLM capabilities seamlessly into the client's existing SaaS platforms and internal tools
Architect and implement monitoring for LLM performance, cost, and latency; optimize inference pipelines using techniques like model quantization and caching
Establish guardrail systems to evaluate output accuracy, mitigate hallucinations, and maintain audit trails of model decisions for compliance

Qualification

NLPLarge Language ModelsPythonLLM frameworksVector databasesCloud platformsAPI developmentEvaluation frameworks

Required

Experience with NLP and Large Language Models (3+ years in production settings)
Experience with LLM frameworks and orchestration tools (LangChain, LlamaIndex, Hugging Face transformers)
Practical knowledge of vector databases (Pinecone, Weaviate, pgvector) and embedding strategies
Experience with Python and experience building and deploying scalable APIs (5+ years in production settings)
Familiarity with cloud platforms (AWS, GCP, Azure) for MLOps

Preferred

A proven track record of successfully deploying a RAG system or LLM-powered application to production
Experience implementing evaluation frameworks and metrics for generative AI outputs

Company

Sphere

twittertwittertwitter
company-logo
Drive your sustainable digital transformation with focus on innovation and scale

H1B Sponsorship

Sphere has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2021 (3)
2020 (1)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Leon Ginsburg
CEO/Founder
linkedin
leader-logo
Alex Korenev
Community Engagement Partner
linkedin
Company data provided by crunchbase