AI Engineer - LLMs, Agents & RAG H/F jobs in United States
cer-icon
Apply on Employer Site
company-logo

EY · 3 hours ago

AI Engineer - LLMs, Agents & RAG H/F

EY exists to build a better working world, helping to create long-term value for clients, people and society. As an AI Engineer within the internal development team, you will design and implement the intelligence layer of multi-agent systems, collaborating to build and deploy secure AI agents powered by LLMs and Retrieval-Augmented Generation architectures on Azure.

AccountingAdviceBusiness IntelligenceConsultingFinancial ServicesProfessional Services
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Develop and maintain LLM- and RAG-based AI agents using Azure OpenAI, LangChain/Semantic Kernel, and Azure ML
Collaborate with the Full Stack Developer to integrate agent endpoints into web and backend applications
Work with DevSecOps to ensure secure deployment, monitoring, and version control of AI components
Implement vector search pipelines (Azure Cognitive Search, FAISS, or Pinecone)
Optimize model inference for latency, accuracy, and scalability
Participate in daily standups, sprint reviews, and code reviews as part of the dev team

Qualification

PythonLLMsRAG designAzure AI ServicesVector databasesTransformer architecturesMLOps principlesMetadata-based retrievalCollaboration skills

Required

1–2 years' experience developing AI or NLP applications in Python
Hands-on experience with LLMs, prompt engineering (Crafting and optimizing), and RAG design
Experience designing RAG pipelines for enterprise search or document intelligence
Knowledge of vector databases (e.g., Qdrant, Chroma)
Knowledge of document chunking, embedding models, and context window optimization
Familiarity with metadata-based retrieval and re-ranking strategies
Understanding of agent architectures
Ability to orchestrate multiple agents for collaborative or role-based tasks
Strong understanding of transformer architectures (GPT, LLaMA, Mistral, Claude, etc)
Experience with LLM fine-tuning and prompt engineering
Familiarity with inference optimization, quantization (e.g., bitsandbytes), and deployment techniques
Hands-on experience using OpenAI, Hugging Face Transformers, or LangChain
Knowledge of model evaluation metrics (e.g., perplexity, hallucination rate, factual consistency)
Prior experience deploying LLM-based agents or RAG systems in production is a major plus
Familiarity with Azure AI Services, Azure ML, Azure Functions, and APIs
Understanding of data security, versioning, and MLOps principles
Strong collaboration skills within cross-functional agile teams

Benefits

Support and coaching
Opportunities to develop new skills and progress your career
Freedom and flexibility to handle your role in a way that’s right for you
Continuous learning
Tools and flexibility, so you can make a meaningful impact, your way
Insights, coaching and confidence to be the leader the world needs
Diverse and inclusive culture

Company

EY is building a better working world by creating new value for clients, people, society, the planet, while building trust in the capital markets.

H1B Sponsorship

EY has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (10242)
2024 (9877)
2023 (10966)
2022 (9394)
2021 (5652)
2020 (8849)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Jonathan Williamson
Chief Operating Officer
linkedin
leader-logo
Abhishek Sen
Partner
linkedin
Company data provided by crunchbase