GAIA · 12 hours ago
Senior Software Engineer, AI Agents (Autonomous Systems) — Gaia
Gaia is building the next generation of Gaia.com experiences using AI. This role focuses on designing and shipping agentic, autonomous software systems that can plan, act, evaluate outcomes, and continuously improve—driving real product impact, not demos.
Health CareHospitalManagement ConsultingTherapeutics
Responsibilities
Architect and implement agentic AI systems that autonomously execute multi-step workflows (planning, tool use, memory, evaluation, refinement)
Build and own production services in Python that orchestrate LLM-based reasoning, retrieval, tool calling, and safe action execution
Design autonomy loops: task decomposition, reflection/self-critique, reward signals, evaluation harnesses, and guardrails
Develop robust RAG pipelines for Gaia’s content ecosystem (semantic search, chunking, embeddings, reranking, citations, freshness)
Create frameworks for agent reliability: testing, simulation, regression suites, red-teaming, and continuous evaluation
Implement observability for LLM systems: tracing, cost/latency monitoring, failure taxonomy, quality metrics, and incident response
Partner with product, design, and content teams to translate Gaia’s mission and user needs into autonomous capabilities
Optimize for performance and cost: caching, batching, model routing, quantization (where relevant), and prompt/system improvements
Ship continuously: build, measure, learn—tight loops, pragmatic decisions, and visible progress
Qualification
Required
Expert-level Python and experience building production services (APIs, workers, pipelines, orchestration)
Deep knowledge of LLMs and agentic systems, including strengths/limits, failure modes, and practical patterns for reliability
Proven track record of execution: you ship, you iterate, you improve outcomes based on real signals
Strong “builder + owner” mindset: you take ambiguous problems, create clarity, and deliver results
Entrepreneurial mindset: bias toward action, comfort with uncertainty, high accountability, and strong product instincts
Solid foundation in mathematics, statistics, and data reasoning (you can quantify uncertainty, validate improvements, and avoid hand-wavy conclusions)
Strong data fluency: instrumentation, metrics design, experiment analysis, and operational decision-making using data
Preferred
Hands-on experience building agentic workflows using modern frameworks (e.g., LangGraph/LangChain, LlamaIndex, Semantic Kernel, or equivalent custom stacks)
Experience with tool-using agents: function calling, structured outputs, constrained decoding, and robust schema validation
Experience with evaluation techniques for LLM systems (golden sets, model-graded evals, pairwise ranking, offline/online correlation)
Experience with retrieval systems: vector databases, hybrid search, reranking, query rewriting, and content freshness strategies
Knowledge of prompt/system design for production (instruction hierarchies, routing, safety constraints, and jailbreak resistance)
Experience with distributed systems and async execution patterns (queues, orchestration, retries, idempotency, backpressure)
Experience deploying and scaling LLM-enabled services in cloud environments (AWS/GCP/Azure), including CI/CD and IaC
Familiarity with MLOps/LLMOps tooling: experiment tracking, model gateways, prompt/version management, and tracing
Experience with privacy/security considerations for AI systems (PII handling, data minimization, auditability)
Front-end or full-stack capability is a plus (you can ship end-user impact, not just back-end components)
Prior work in consumer subscription products, content platforms, personalization, or discovery systems
Benefits
On-site gym
Beautiful solar-powered campus, complete with hiking and running trails, community garden, and a labyrinth
On-site, mostly organic café that serves breakfast and lunch daily including a full-service espresso bar featuring locally roasted coffee
Alternative and traditional medical benefits including preventative coverage
Dental
Vision
401K
Life insurance
Company
GAIA
GAIA develops clinically-validated digital therapeutics that empower patients, payers, and physicians with effective, FDA-cleared products.
H1B Sponsorship
GAIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (1)
Funding
Current Stage
Growth StageCompany data provided by crunchbase