AI Engineer, .*RAG jobs in United States
cer-icon
Apply on Employer Site
company-logo

Eloquent AI · 5 months ago

AI Engineer, .*RAG

Eloquent AI is a fast-growing global company focused on building autonomous AI agents for regulated industries. As a Senior AI Engineer, .*RAG, you will design and optimize RAG systems that power enterprise AI agents, ensuring they deliver accurate, real-time responses while collaborating with researchers and engineers to enhance AI capabilities.

Enterprise SoftwareFinTechGenerative AISaaS

Responsibilities

Design and implement scalable RAG pipelines that enable AI agents to retrieve and generate knowledge in real time
Develop and optimize knowledge retrieval systems, fine-tuning embeddings, vector search, and ranking models
Work with LLM architectures, applying prompt engineering, fine-tuning, and reinforcement learning techniques to improve response accuracy
Optimize large-scale AI workloads, ensuring low latency and high efficiency for enterprise-grade AI applications
Collaborate with AI researchers to translate state-of-the-art RAG advancements into deployable, high-performing solutions
Leverage cloud infrastructure (AWS, GCP, or Azure) to build distributed, high-availability AI systems
Continuously improve knowledge ingestion, ensuring AI agents stay up-to-date with evolving enterprise datasets

Qualification

RAG architecturesLLM expertisePythonAI frameworksCloud computingNLP techniquesReinforcement LearningCross-functional collaboration

Required

5+ years of software engineering experience, with a focus on AI, NLP, or distributed systems
Strong proficiency in Python and experience with AI frameworks like PyTorch and TensorFlow
Expertise in RAG architectures, including experience with vector databases (e.g., FAISS, Weaviate, Pinecone, Milvus) and document retrieval methods
Familiarity with LLM training, knowledge distillation, and agentic frameworks
Experience with cloud computing and building scalable, production-ready AI applications
Ability to optimize AI models for efficiency, balancing accuracy, latency, and cost
Deep understanding of NLP and IR techniques, including tokenization, embeddings, ranking algorithms, and their evaluation

Preferred

You have published research in AI, NLP, or RAG-related topics at top-tier conferences (NeurIPS, ICML, ICLR, ACL, SIGIR, etc.)
You have experience implementing hybrid RAG pipelines, combining retrieval with multi-step reasoning and tool use
You've worked in high-performance AI teams, scaling AI-driven applications in fast-growth environments
You have experience with Reinforcement Learning from Human Feedback (RLHF) and optimizing LLMs for enterprise use cases
You are comfortable working in cross-functional AI product teams, collaborating with researchers, engineers, and product managers

Company

Eloquent AI

twittertwittertwitter
company-logo
The AI Operator for Financial Services

Funding

Current Stage
Early Stage
Total Funding
$8.4M
Key Investors
Amazon Web ServicesFoundation Capital
2025-10-08Non Equity Assistance· $1M
2025-09-08Seed· $7.4M
Company data provided by crunchbase