AI Engineer jobs in United States
info-icon
This job has closed.
company-logo

RIT Solutions, Inc. · 8 hours ago

AI Engineer

RIT Solutions, Inc. is seeking an experienced AI Engineer to design, build, and operate AI/ML infrastructure and agentic systems. The role involves developing MCP servers and agents, integrating LLMs, and implementing RAG pipelines for production environments.

Staffing & Recruiting
check
H1B Sponsor Likelynote

Responsibilities

Design, build and operate MCP servers and MCP agents that host, orchestrate and monitor AI/agent workloads
Develop agentic AI, prompt engineering patterns, LLM integrations and developer tooling for production use
Own deployment, scaling, reliability and cost-efficiency on Kubernetes/Docker and Google Cloud with automated CI/CD
Design and implement RAG (Retrieval‐Augmented Generation) pipelines and integrations with vector stores and retrieval tooling; use LangChain and Langfuse for orchestration, chaining, and observability
Implement and maintain MCP server and agent code, APIs, and SDKs for model access and agent orchestration
Design agent behavior, workflows and safety guards for agentic AI systems
Create, test and iterate prompt templates, evaluation harnesses and grounding/chain‐of‐thought strategies
Integrate LLMs and model providers (self‐hosted and cloud APIs) with unified adapters and telemetry
Build developer tooling: CLI, local runner, simulators, and debugging tools for agents and prompts
Containerize services (Docker), manage orchestration (Kubernetes/GKE), and optimize nodes, autoscaling and resource requests
Ensure observability: logging, metrics, traces, dashboards, alerting and SLOs for model infra and agents
Create runbooks, playbooks and incident response procedures; reduce MTTR and perform postmortems
Design and maintain RAG workflows: document chunking, embeddings, vector indexing, retrieval strategies, re‐ranking and context injection
Integrate and instrument LangChain for composable chains, agents and tooling; use Langfuse (or equivalent tracing) to capture prompts, model calls, RAG traces and evaluation telemetry

Qualification

AI/ML infrastructureKubernetesDockerPythonLLMsRAG implementationLangChainGoogle Cloud PlatformCI/CDPrompt engineeringSecurity best practicesTestingObservabilitySoft skills

Required

5+ years of Strong Software Engineering (Python/NodeJS), system design and production service experience
2+ years of Experience with LLMs, prompt engineering, and agent frameworks
2+ years of Experience Practical experience implementing RAG: embeddings, vector DBs and retrieval tuning
2+ years of Experience with LangChain patterns and with toolchain telemetry (Langfuse or similar) for prompt/model traceability
5+ years of Experience with Kubernetes, Docker, CI/CD and infrastructure‐as‐code experience
2+ years of Experience with Practical experience with Google Cloud Platform services
2+ years of Experience with Observability, testing, and security best practices for distributed systems
2+ years of Experience with evaluating and mitigating retrieval/augmentation failures, hallucinations, and leakage risks in RAG systems
Familiarity with vendor and open‐source vector stores and embedding providers
Familiarity with CI/CD pipelines (Jenkins, GitHub Actions, GitLab CI, or ArgoCD)

Company

RIT Solutions, Inc.

twitter
company-logo
Jobdiva Job Portal: https://www1.jobdiva.com/candidates/myjobs/searchjobsdone.jsp?a=xbjdnwgjodtga1y1im2g881fkkeiwd0775lbvq8yqgps8vb2q36w2vj1ga6xxork&compid=-1 Recruitment (contingency search and campus selection).

H1B Sponsorship

RIT Solutions, Inc. has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2023 (2)

Funding

Current Stage
Growth Stage
Company data provided by crunchbase