CES · 1 week ago
Senior AI Engineer 2904(Remote)
CES is a company with over 26 years of experience in delivering Software Product Development and Digital Transformation Consulting Services. They are seeking a hands-on AI Engineer to design and ship customer-facing AI features powered by modern LLMs, collaborating with various teams to develop reliable and measurable AI solutions.
Cyber SecurityInformation Technology
Responsibilities
Own end-to-end development of LLM features: problem framing, data prep, prototyping, offline/online evaluation, deployment, and monitoring
Build retrieval-augmented generation (RAG) pipelines with vector search (e.g., FAISS, Pinecone, OpenSearch/KNN) and document orchestration
Implement prompt strategies, tool use/function calling, and guardrails for safety, bias, and privacy
Integrate models in production services (REST/GraphQL/gRPC), including auth, rate limiting, and observability
Stand up evals and experiment frameworks (A/B tests, golden sets, regression suites) with clear success metrics
Optimize for latency, cost, and quality: prompt compression, caching, model selection, fine-tuning/LoRA, distillation where appropriate
Collaborate with DevOps/MLOps/Platform to automate CI/CD, data/version management, and feature flags
Embed with CX/Support to mine tickets, chats, and call transcripts; convert VOC into training/eval datasets and backlog priorities
Instrument user journeys and define online/offline evals (win rate, hallucination rate, TTR, CSAT/NPS); run A/B tests and ship iterative improvements
Build feedback loops (thumbs-up/down, rationale capture, escalation) and human-in-the-loop fallbacks that protect quality
Own reliability and UX details that matter for customers: latency budgets, safe fallbacks, clear handoff to human agents, accessibility
Partner with Trust/Legal/Security to ensure privacy-by-design and compliant data handling; implement guardrails and red-team mitigations
Document designs and teach best practices to engineering partners
Ship 1–2 LLM features to production with SLAs, monitoring, and rollback plans
Establish an eval harness (offline + online) and quality gates for prompts/RAG
Reduce average latency/cost per request by ≥20% without quality regression
Create internal runbooks and dashboards for reproducibility and troubleshooting
Qualification
Required
4–6 years in applied ML/AI or backend engineering with measurable production impact
Strong Python and software engineering fundamentals (testing, types, CI/CD)
Practical LLM experience: OpenAI/Anthropic, or cloud providers (AWS Bedrock, Azure OpenAI, GCP Vertex)
Experience with at least one deep learning or LLM framework (PyTorch, Transformers, vLLM) and one orchestration library (LangChain, LlamaIndex, Guidance, or custom)
RAG and data pipelines: chunking/embedding strategies, vector DBs, metadata filtering, and document QA
Monitoring/telemetry for AI systems (e.g., MLflow, Weights & Biases, Prometheus, custom eval dashboards)
Security & privacy awareness (PII handling, redaction, data retention)
Model customization (fine-tuning/LoRA) and synthetic data generation
Streaming and toolcalling/agents, structured outputs (JSON, function schemas)
Cloud & MLOps: AWS (SageMaker/Bedrock/Lambda), Docker, Terraform, Kubernetes
Frontend integration patterns for AI UX (streaming UIs, fallbacks, user feedback loops)
Domain experience in compliance-heavy environments (e.g., education, finance, healthcare)
Benefits
Flexible working hours to create a work-life balance.
Opportunity to work on advanced tools and technologies.
Global exposure to not only collaborate with the team, but also to connect with the client portfolio and build professional relationships.
Highly encouraged for any innovative ideas & thoughts and we support in executing the same.
Periodical and on-spot rewards and recognitions on your performance.
Provides a better platform for enhancing skills via many different L&D programs.
Enabling and empowering atmosphere to work along.
Company
CES
CES is an information technology and business process management company.
H1B Sponsorship
CES has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (14)
2024 (10)
2023 (7)
2022 (13)
2021 (11)
2020 (15)
Funding
Current Stage
Late StageRecent News
shropshirestar.com
2025-09-20
The Hollywood Reporter
2025-05-17
shropshirestar.com
2025-04-30
Company data provided by crunchbase