AI Engineer - Site Reliability Researcher jobs in United States
cer-icon
Apply on Employer Site
company-logo

Traversal · 2 months ago

AI Engineer - Site Reliability Researcher

Traversal is the AI Site Reliability Engineer for the enterprise, trusted by major companies to manage complex production incidents. The role involves troubleshooting, collaborating with engineering teams, and establishing SRE research practices to enhance customer satisfaction.

Artificial Intelligence (AI)SoftwareSoftware Engineering

Responsibilities

Troubleshooting Disparate Systems: Our customers use a wide variety of platforms so flexibility and curiosity are critical
External Interface: Gather requirements from new customers, guide them through on-boarding and maintain positive relationships to ensure their success
Internal Collaboration: Partner with engineering, AI, and product teams, passing along what you learn from end-users, as well as your own input
Evaluation and Analysis: Using your troubleshooting and customer RCAs to evaluate Traversal's performance and find ways to further improve it
Incident Management : Lead and further our internal on-call and incident response processes, including alerting, debugging, and postmortems

Qualification

Site Reliability EngineeringDebugging distributed systemsObservability toolsCloud environmentsInfrastructure as CodeNetworking knowledgeCustomer relationship managementCollaborationProblem-solving

Required

5+ years of experience as an SRE, infrastructure engineer, or similar role in fast-paced environments
Innate ability to debug distributed systems (e.g.: bare metal, VMs, Kubernetes, Docker, containers), understand how you did it and explain it to others
Expertise with observability and metrics tools (Datadog, Elasticsearch, Grafana, OpenTelemetry, Prometheus, ServiceNow, Splunk, etc) and incident response
Understanding of networking including routers, switches, firewalls, VPNs, etc
Hands-on experience with cloud environments (AWS, Azure, Digital Ocean, GCP) and Infrastructure As Code like Helm and Terraform
Experience supporting cloud/on-prem and hybrid deployments

Preferred

Background in developer productivity tooling or internal platform teams
Prior experience building systems that connect infra events to developer workflows
Exposure to agentic systems or AI observability platforms

Benefits

Health insurance
Startup equity
Flexible time off
Plenty of in-office snacks

Company

Traversal

twittertwittertwitter
company-logo
Traversal is building the AI SRE for the enterprise.

Funding

Current Stage
Early Stage
Total Funding
$48M
Key Investors
Sequoia CapitalKleiner Perkins
2025-06-20Seed
2025-06-18Series A· $48M
Company data provided by crunchbase