Senior AI / Agentic AI & LLM Engineer (Lead) – Python, vLLM / TGI jobs in United States
cer-icon
Apply on Employer Site
company-logo

Halian | Managed Services, Recruitment Agency & Contract Staffing · 4 days ago

Senior AI / Agentic AI & LLM Engineer (Lead) – Python, vLLM / TGI

Halian is partnering with a well-funded, AI-driven technology company operating in highly regulated healthcare and insurance environments across the United States. They are seeking a highly skilled Senior AI / LLM Lead Engineer to take ownership of end-to-end AI initiatives, from architecture and design through to hands-on implementation and production deployment.

EnterpriseInformation TechnologyProfessional Services
badNo H1Bnote
Hiring Manager
Marlon Bannister
linkedin

Responsibilities

Lead end-to-end AI and LLM initiatives, from problem definition and system architecture to production deployment
Remain hands-on in building, testing, and optimising LLM pipelines, agentic systems, and ML workflows
Design and implement agentic and multi-agent systems for real-world applications
Develop tailored LLM pipelines using frameworks such as LangChain, LangGraph, and related ecosystems
Deploy and optimise open-source LLMs using vLLM, TGI, and Python-based inference stacks
Fine-tune, evaluate, and integrate open-source models including Qwen, Llama, Mistral, and similar families
Build and maintain OCR and document intelligence pipelines, including layout-aware processing and post-OCR normalisation
Design and optimise document chunking strategies for embeddings, retrieval, and long-context reasoning
Build and operate large-scale embeddings and vector search pipelines
Apply unsupervised learning techniques on tabular data for clustering, similarity analysis, and anomaly detection
Provide technical leadership, mentorship, and best-practice guidance to engineers
Collaborate with cross-functional teams on architecture, performance goals, and delivery standards

Qualification

PythonAgentic AILLM pipelinesVLLMLangChainLangGraphUnsupervised learningTechnical leadershipCross-functional collaboration

Required

Strong proficiency in Python
Deep experience with Agentic AI, autonomous agents, and orchestration frameworks
Hands-on experience building and deploying LLM pipelines and multi-agent systems
Experience with vLLM, TGI, and LLM serving infrastructure
Extensive experience with open-source LLM ecosystems
Experience with LangChain, LangGraph, or similar frameworks
Strong understanding of chunking strategies, embeddings, retrieval quality, and hallucination control
Experience building embeddings pipelines, vector indexing, and retrieval workflows
Practical experience applying unsupervised learning techniques (e.g. clustering, similarity analysis, anomaly detection)
Must be based in the United States
Must have valid US work authorisation (no sponsorship available)

Company

Halian | Managed Services, Recruitment Agency & Contract Staffing

twittertwittertwitter
company-logo
Halian, with nearly 30 years of experience across the Middle East, Europe, and the US, is dedicated to shaping the future of Workforce Management and Managed Services.