AHEAD · 1 month ago
AI Applications Architect, AI Services
AHEAD builds platforms for digital business, focusing on digital transformation through cloud infrastructure and software delivery. The AI Applications Architect is responsible for designing enterprise-grade architectures for agentic AI solutions and leading engineering teams to deliver scalable and reliable systems.
Cloud ComputingInformation TechnologySoftwareStaffing AgencyVirtualization
Responsibilities
Design and own cloud-native architectures (AWS/Azure) for agentic AI workloads using Kubernetes/EKS, Terraform, Docker, serverless APIs, AWS Batch, and async orchestration frameworks (Celery, Step Functions, EventBridge, StoneBranch)
Define agentic system patterns using LangChain, LangGraph, Autogen, LlamaIndex, Pinecone, and other multi-agent frameworks; ensure consistency of prompt/tool design, memory/state handling, and workflow orchestration
Architect vector database, RAG, embeddings pipelines, and model-serving endpoints (LLM/SLM) with strong emphasis on scalability and latency management
Establish platform-wide standards for API gateway patterns, identity and auth (OAuth2, Cognito, Vault), secrets management, event contracts/schemas, and data governance
Ensure holistic observability across multi-agent systems: tracing, metrics, logging, SLO/SLA definitions, synthetic checks, and incident response playbooks
Lead architecture reviews, threat modeling, and performance benchmarking for agentic workloads
Guide engineering teams through architectural decisions, distributed design principles, and production-readiness standards
Mentor engineers in Kubernetes/EKS, async programming, multi-agent orchestration, cloud-native development, and responsible AI practices
Provide input on hiring, onboarding, and talent development to grow AHEAD’s agentic engineering bench
Partner with Delivery Leads to ensure architecture is executable, scalable, and aligned with timelines
Champion automation, IaC, CI/CD, model deployment workflows, runbooks, and platform governance
Lead sprint-level architectural alignment, backlog refinement, retrospectives, and post-incident reviews
Work with Product Owners and client stakeholders to shape roadmaps, define technical scope, and convert ambiguous problem statements into actionable designs
Communicate architectural decisions clearly to both technical and business audiences, balancing constraints, risks, and tradeoffs
Embed platform security, compliance, cost optimization, and data integrity into all architectural decisions
Qualification
Required
6+ years designing and delivering cloud-native, event-driven, or distributed architectures at scale (AWS/Azure)
Deep hands-on experience with: Kubernetes/EKS, Docker, Terraform, and cloud infrastructure patterns
Python, FastAPI, async frameworks, serverless APIs
Vector DBs (Pinecone, Elasticsearch, pgvector) and RAG/LLM integration workflows
Agentic AI frameworks (LangChain, LangGraph, Autogen, CrewAI, LlamaIndex)
Strong knowledge of security, identity, devsecops pipelines, and secrets management in cloud environments
Proven leadership experience guiding engineering teams, performing code/design reviews, and enforcing architectural best practices
Excellent communication, stakeholder alignment, and documentation skills
Preferred
Experience operating LLMs/SLMs in production (NIMs, Bedrock, OpenAI, Azure OpenAI)
Experience with GPU clusters, inference optimization, or model-serving architectures (Ray, Triton, KServe)
Consulting or client-facing architecture experience
Benefits
Medical, Dental, and Vision Insurance
401(k)
Paid company holidays
Paid time off
Paid parental and caregiver leave
Plus more! See benefits https://www.aheadbenefits.com/ for additional details.
Company
AHEAD
AHEAD is a solutions-based company that helps clients move to an optimized IT service delivery model.
H1B Sponsorship
AHEAD has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (24)
2024 (19)
2023 (22)
2022 (20)
2021 (18)
2020 (1)
Funding
Current Stage
Late StageTotal Funding
$97.72M2024-05-06Series Unknown· $5.7M
2024-02-15Series Unknown· $43.6M
2023-11-02Series Unknown· $5.77M
Recent News
2025-11-19
Company data provided by crunchbase