Airbyte · 1 week ago
Software Engineer, Applied AI
Airbyte is the open-source standard for data movement, enabling data teams to efficiently transfer data across various sources. The Software Engineer on the Data Replication team will design and build intelligent systems to enhance data movement and improve sync reliability using AI-driven solutions.
AnalyticsData IntegrationGenerative AIOpen SourceProductivity Tools
Responsibilities
Build AI-driven systems for data replication and connector lifecycle management, accelerating connector development, rollout, testing, and upgrades across OSS, Enterprise, and Cloud
Design and implement agentic workflows that assist with diagnosing sync failures, schema evolution issues, performance regressions, and rollout risks across large fleets of connectors
Build connectors and frameworks with AI to scale a wide range of reliable integrations
Develop observability, anomaly detection, and automated remediation systems (ML + LLM hybrid) for data sync execution, job correctness, and CDC pipelines
Improve control plane and data plane operations by automating deployment validation, release qualification, and environment testing (AWS, GCP, local, KIND)
Own AI systems across the full lifecycle: design, prompt engineering, evaluation, deployment, monitoring, and iteration in production (LLMOps)
Partner closely with platform, infra, and product teams to embed AI-powered capabilities into Airbyte’s deployment flows, APIs, and Cloud self-serve experience
Build high-leverage internal tooling that helps Airbyte ship connector and CDK changes faster while maintaining correctness, performance, and cost efficiency
Qualification
Required
5+ years of engineering experience (backend, platform, or distributed systems) with strong proficiency in Python and/or Kotlin
Hands-on experience building or operating data pipelines, replication systems, or ETL/ELT platforms
Experience designing systems that integrate LLMs with structured data, logs, APIs, or retrieval systems
Familiarity with agentic or orchestration frameworks (e.g., LangChain, Pydantic AI, Temporal-style workflows)
Experience deploying and monitoring production systems, including LLMOps, observability, and alerting
Experience running services on Kubernetes, Helm, Terraform, and major cloud providers
Strong understanding of APIs, databases, connectors, schemas, and telemetry in distributed environments
Systems-level thinking with an emphasis on performance, reliability, cost, and scalability
A startup-ready mindset: comfortable with ambiguity, moving fast, and owning problems end-to-end
A builder's instinct for automation, leverage, and developer experience
Preferred
Experience with open-source platforms, especially in data integration or infrastructure tooling
Familiarity with Airbyte, CDKs, or connector-based architectures
Exposure to large-scale connector fleets, schema evolution, CDC, or long-running sync execution
Background in control plane/data plane architectures or internal developer platforms
Company
Airbyte
Airbyte is an open-source data integration engine that helps to sync data from applications to warehouses.
H1B Sponsorship
Airbyte has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (11)
2024 (5)
2023 (5)
2022 (4)
2021 (1)
Funding
Current Stage
Growth StageTotal Funding
$181.2MKey Investors
Altimeter Capital,CoatueBenchmarkAccel
2021-12-17Series B· $150M
2021-05-25Series A· $26M
2021-03-02Seed· $5.2M
Recent News
2025-12-05
2025-11-04
Company data provided by crunchbase