Founding Software Engineer, Data Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Airweave (YC X25) · 1 month ago

Founding Software Engineer, Data Infrastructure

Airweave is a company focused on building scalable data infrastructure for AI agents. They are seeking a founding engineer to own the data and infrastructure layer, ensuring reliable data flows and optimizing LLM inference under production loads.

Artificial Intelligence (AI)DatabaseSoftware

Responsibilities

Design and scale distributed data pipelines that sync hundreds of millions of documents from dozens sources into advanced search indexes
Build and improve Temporal workflows for parallel sync orchestration: retries, backpressure, and failure recovery across workers
Own our Kubernetes deployments with Helm charts: autoscaling, and resource management for bursty search, sync and LLM workloads
Scale PostgreSQL for high-throughput; connection pooling, read replicas, partitioning (we ask a lot from this database)
Manage vector database (Vespa) infrastructure: sharding, replication, backup strategies for large-scale agentic search
Orchestrate and optimize LLM inference pipelines: batching, caching, provider failover
Build monitoring and alerting with Prometheus, Grafana, and custom instrumentation for cluster health
Infrastructure as code for the base with Terraform

Qualification

Distributed data pipelinesKubernetesPostgreSQLInfrastructure as codeVector databasesLLM infrastructureMonitoringAlertingSoft skills

Required

You've built or operated data pipelines at scale: ETL, event processing, streaming, or sync infrastructure
You're comfortable with Kubernetes, Terraform, and infrastructure as code
You've scaled databases and understand the tradeoffs (pooling, replication, sharding)
You have experience with distributed systems: workflow orchestration, message queues, eventual consistency
You're interested in LLM infrastructure: embeddings, vector search, inference optimization
You like building reliable systems and have opinions about observability
You're drawn to early-stage environments where you own the whole problem

Preferred

Experience with Temporal, Airflow, or similar workflow engines
Background in scaling search (Elastic, Qdrant, Pinecone, Weaviate)
Familiarity with LLM inference

Benefits

Health, dental, and vision coverage
Work in-person in San Francisco with a highly-skilled, technical team
Direct impact on architecture and infrastructure decisions from the first week

Company

Airweave (YC X25)

twittertwittertwitter
company-logo
Airweave is an open-source tool that helps agent developers turn app data into accessible knowledge for AI agents.

Funding

Current Stage
Early Stage
Total Funding
$6M
Key Investors
FundersClub
2025-07-02Seed· $6M
Company data provided by crunchbase