Traversal · 5 days ago
AI Engineer - Data Platform
Traversal is an AI Site Reliability Engineer (SRE) for the enterprise, trusted by major companies to address complex production incidents. The role involves designing, building, and maintaining backend systems for an AI-driven observability platform, focusing on reliability and performance across cloud and on-prem deployments.
Artificial Intelligence (AI)SoftwareSoftware Engineering
Responsibilities
Contribute to the design and implementation of scalable, resilient infrastructure systems that power AI-driven root cause analysis and observability workflows across diverse on-premises environments
Work on the foundational building blocks of our infrastructure, ensuring efficient use of resources and high performance at scale
Profile and tune backend systems to improve throughput, reduce latency, and minimize bottlenecks across the stack
Help build and maintain the internal observability stack—logs, metrics, and traces—used by our agents to understand and act on production issues
Support architectures for both cloud-hosted (SaaS) and on-prem deployments to serve enterprise customers
Develop and maintain low-latency, high-throughput pipelines using tools like Kafka, Postgres, and S3 for real-time telemetry workflows
Contribute to infrastructure-as-code, CI/CD tooling, and deployment systems to increase platform velocity and stability
Work with AI, platform, and product teams to ensure smooth integration and shared reliability goals
Help ensure our own observability tooling supports how we debug, monitor, and operate our systems
Qualification
Required
Professional experience with Rust (our primary language for infrastructure), or strong systems-level programming experience in OCaml, C++, C or Zig
Experience building distributed systems using a variety of application-appropriate datastores (e.g., Postgres, object storage, etc.)
Strength in debugging across cloud infrastructure, networking layers, and production systems (instrumentation, provisioning, bug fixes, reliability improvements)
Experience with performance profiling and optimization in backend systems
Exposure to low-level system design concepts (e.g., concurrency models, storage internals, OS, and DB level tuning)
Preferred
Experience making complex software systems observable using logs, metrics, and traces
Familiarity with Python-based ecosystems
Background in large-scale, complex, data-driven applications, and familiarity with event streaming platforms such as Kafka
Experience provisioning and managing infrastructure using Terraform, Pulumi, or other IaC tools
Familiarity with AI or LLM-powered products
Benefits
Health insurance
Flexible time off
Plenty of in-office snacks
Competitive salary and equity packages
Company
Traversal
Traversal is building the AI SRE for the enterprise.
Funding
Current Stage
Early StageTotal Funding
$48MKey Investors
Sequoia CapitalKleiner Perkins
2025-06-20Seed
2025-06-18Series A· $48M
Company data provided by crunchbase