Senior Software Engineer, Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Serval · 15 hours ago

Senior Software Engineer, Infrastructure

Serval is an AI platform for IT teams that aims to automate complex IT workflows for modern enterprises. As a Senior Software Engineer in Infrastructure, you will build and scale foundational systems that support Serval's AI agents and workflow automation platform, focusing on self-hosted deployments for enterprise customers.

Agentic AIArtificial Intelligence (AI)Information TechnologyNatural Language Processing

Responsibilities

Design, implement, and operate large-scale distributed systems that power Serval's AI agents, workflow orchestration, and data pipelines
Write and maintain Terraform modules to provision and manage cloud infrastructure across AWS, GCP, or Azure environments
Build and maintain deployment packages, installation scripts, and infrastructure templates that enable customers to self-host Serval in their own environments
Provide technical guidance and troubleshooting support to enterprise customers deploying and operating self-hosted instances of Serval
Ensure high availability, performance, and reliability of production systems through monitoring, alerting, incident response, and capacity planning
Build internal tools and platforms that enable product engineers to deploy, test, and operate services efficiently
Collaborate with engineering teams to design resilient, scalable architectures that support both cloud-hosted and self-hosted deployment models
Profile and optimize system performance, including compute, storage, networking, and database layers
Implement security best practices and ensure infrastructure meets enterprise compliance requirements for both managed and self-hosted deployments

Qualification

Distributed systemsTerraformCloud provider expertisePythonGoNetworking knowledgeContainerizationMonitoring toolsDebugging skillsTechnical communication

Required

3+ years building and operating large-scale distributed systems in production environments
Strong experience writing and maintaining Terraform for infrastructure provisioning and management
Deep knowledge of at least one major cloud provider (AWS, GCP, or Azure), including compute, networking, storage, and managed services
Experience building, packaging, and supporting self-hosted or on-premises software deployments for enterprise customers
Proficiency in Python, Go, or similar languages for building automation, tooling, and infrastructure services
Strong understanding of networking, databases, containerization (Docker, Kubernetes), and orchestration systems
Experience with monitoring, logging, alerting, and incident management tools (e.g., Datadog, Prometheus, Grafana, PagerDuty)
Ability to communicate technical concepts clearly to customers and provide infrastructure support and guidance
Ability to debug complex system issues, analyze performance bottlenecks, and implement effective solutions

Preferred

Experience with Kubernetes in production, including cluster management and workload orchestration
Background in CI/CD systems, build pipelines, and deployment automation
Experience with workflow orchestration systems such as Temporal, including long-running workflows, retries, and failure handling
Experience with data infrastructure (streaming systems like Kafka, data warehouses, ETL pipelines)
Knowledge of security and compliance frameworks (SOC 2, ISO 27001, GDPR)
Experience supporting enterprise customers with complex deployment requirements
Previous work at a high-growth startup or experience scaling infrastructure rapidly

Benefits

Impact: Be a key player in shaping the success of our product and company.
Growth: Build a fundamentally new AI product offering with the support of our experienced team and investors. Grow rapidly with the company.
Culture: Join a culture that values innovation, ownership, accountability, and fun.

Company

Serval

twittertwitter
company-logo
Serval provides an AI-native IT service management platform that automates routine tasks and streamlines help desk requests.

Funding

Current Stage
Growth Stage
Total Funding
$122M
Key Investors
Sequoia CapitalRedpoint
2025-12-11Series B· $75M
2025-10-21Series A· $47M

Leadership Team

leader-logo
Jake Stauch
Founder and CEO
linkedin
Company data provided by crunchbase