NAPA Auto Parts · 15 hours ago
Site Reliability Engineer III
NAPA Auto Parts is looking for a Site Reliability Engineer III to enhance system reliability and resilience. This role focuses on building automation to reduce manual effort and support large-scale, distributed systems while ensuring critical platforms are reliable and able to support continuous improvement.
AutomotiveIndustrialMachinery Manufacturing
Responsibilities
Gathers and analyzes metrics from monitoring platforms to assist in performance tuning and fault tolerance
Partners with development teams to improve services through testing and release procedures
Participates in system design, platform management and capacity planning
Balances feature development speed and reliability with service-level objectives
Works closely with the incident response team and restoring service to normal operation
Understands debugging and applying troubleshooting skills
Investigates, blocks and rate-limits unwanted traffic
Utilizes monitoring systems and dashboards for proactive changes and alerting
Establishes continuous process improvement cycles where the process, performance, and supporting technologies are reviewed and enhanced where applicable
Performs other duties as assigned
Qualification
Required
Typically requires a bachelor's degree and five (5) or more years of related experience or an equivalent combination
Understanding of Kubernetes, containers, clusters, and elastic scalability
Expertise in SRE principles
Mindset of continually finding ways to drive scalability, stability, and performance
Cloud Services experience with Google Cloud Platform (GCP)
Experience with API, service-based or microservice-based architecture
Proficiency in infrastructure, network, database, operating systems, or security troubleshooting and remediation
Architecture-level knowledge of Windows and Linux and Infrastructure systems
Experience with production deployment, monitoring, and operational support for enterprise-class applications (Dynatrace a plus)
Experience working with Continuous Integration/ Continuous Deployment tools
Experience in performance diagnostics, capacity planning, performance architecture design, performance tuning, and performance monitoring
A strong mix of software engineering and operational support skills
Knowledge of web technologies – HTTP, proxy, java, etc
Experience with Azure DevOps (ADO), Dynatrace, Prometheus, Terraform and Grafana
Benefits
Healthcare coverage
401(k)
Tuition reimbursement
Vacation
Sick
Holiday pay
Company
NAPA Auto Parts
Through nearly 6,000 auto parts stores and over 16,000 auto care and collision centers in the U.S., NAPA has America’s largest network of parts and care.
Funding
Current Stage
Late StageLeadership Team
Recent News
2025-10-29
2025-10-10
Company data provided by crunchbase