Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

X4 Engineering ยท 2 days ago

Reliability Engineer

X4 Engineering is a technology-driven company focused on building and operating large-scale, production-critical systems. They are seeking a Reliability Engineer to ensure system reliability, resilience, and observability, working closely with engineering teams to enhance uptime and scalability in live production environments.

Staffing & Recruiting
badNo H1Bnote
Hiring Manager
Jordan Pickering
linkedin

Responsibilities

Design and maintain highly available, scalable, and performant applications and infrastructure. Implement strategies to prevent downtime and reduce failure impact
Build and operate monitoring, logging, and alerting systems (Prometheus, Grafana, GCP Monitoring, PagerDuty) to detect and resolve issues quickly
Lead production incident response and drive meaningful root-cause analysis to prevent recurrence
Partner closely with engineering teams and an existing DevOps function to enable rapid, reliable software delivery
Improve system reliability through automation, CI/CD pipelines, and infrastructure validation
Contribute to cloud infrastructure automation using Terraform or equivalent tools
Support security and compliance initiatives as they relate to system reliability and availability

Qualification

Site Reliability EngineeringInfrastructure as CodeCloud experienceLinux/Unix administrationCI/CD pipelinesContainerizationScriptingNetworking fundamentalsObservability tooling

Required

Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
4+ years in a Site Reliability Engineering, Reliability Engineering, or DevOps-adjacent role
Strong Linux/Unix administration skills
Proficiency in scripting (Bash; Python, JavaScript, or SQL a plus)
Experience with Infrastructure as Code (Terraform or equivalent)
Hands-on experience with CI/CD pipelines (GitHub Actions or similar)
Containerization experience (Docker, Kubernetes)
Solid networking fundamentals for troubleshooting distributed systems
Cloud experience with GCP preferred (AWS or Azure acceptable)
Experience with observability tooling: Prometheus, Grafana, GCP Logging/Monitoring, PagerDuty, Slack

Company

X4 Engineering

twitter
company-logo
X4 Engineering partner with businesses across the entire engineering spectrum, from early-stage R&D to commercial enterprises to provide world-class talent solutions.

Funding

Current Stage
Early Stage
Company data provided by crunchbase