ALO · 2 hours ago
Manager, Site Reliability Engineering
ALO is a company focused on mindful movement, aiming to enhance the lives of individuals both on and off the mat. They are seeking a Site Reliability Engineering (SRE) Manager to enhance the reliability, scalability, and efficiency of their e-commerce and internal systems, collaborating with various teams to meet organizational goals.
ApparelFashionTextiles
Responsibilities
Work with Digital Technology Leadership: Spend time with each leader on the digital technology team to understand their current portfolio. Steer SRE teams to proactively identify and address issues before they occur and triage issues independently, reducing the need to escalate to engineering teams
Incident Management & Response: Own the end-to-end incident response process for our SRE Level 3 roles, from on-call preparedness to post-incident reviews. Ensure clear severity definitions, escalation paths, and timely communication during incidents to minimize downtime. Co-lead blameless post-mortems and implement process improvements for faster, more accurate incident resolution
Monitoring & Observability: Drive enhancements in monitoring and observability across all products and services. Expand meaningful alerting and dashboards using our tools (e.g., New Relic) to proactively detect issues and reduce alert noise. Continuously refine alert thresholds and ensure only high-priority alerts wake up the on-call team, routing lower-priority issues to the ticketing system. Champion the use of observability scorecards to measure coverage and address gaps
Automation & Tooling: Identify opportunities to automate repetitive tasks and reduce operational toil. Oversee integration between our incident management platform (PagerDuty) and ITSM system. Leverage infrastructure-as-code and other automation tools
Cross-Team Collaboration: Partner with software engineering and DevOps teams to ensure new features and services are production-ready. Establish production readiness checklists and work closely with QA, product, and change management teams to embed SRE practices into the SDLC
Vendor & Partner Reliability: Maintain strong relationships with critical technology vendors. Develop clear vendor support and escalation plans. Collaborate on joint drills or reviews to ensure uptime and recovery objectives are met
Reliability Strategy: Define and track reliability goals aligned with business needs. Report on SRE KPIs and continuously refine the SRE roadmap
Qualification
Required
5+ years in SRE/DevOps/Infrastructure roles, including 2+ years in leadership
Proven experience with mission-critical systems is essential
Strong knowledge of system administration, networking, and cloud infrastructure (e.g., AWS)
Hands-on experience with New Relic, PagerDuty, Freshservice, logging, APM, and tracing tools is required
Significant experience in engineering, with a deep understanding of software development, system architecture, and infrastructure management
Skilled in scripting (Python, Shell) and infrastructure-as-code (Terraform)
Ability to build CI/CD and self-healing mechanisms
Deep understanding of incident response, ITIL/ITSM, and root cause analysis
Experience with alert tuning and communication plans is necessary
Strong management and communication skills
Ability to align cross-functional teams and translate reliability issues into business terms
Committed to continuous improvement and learning
Ability to assess SRE maturity and drive iterative improvements
Benefits
Performance bonuses
Long term incentives
PTO policy
Many other progressive benefits
Company
ALO
We are ALO.
H1B Sponsorship
ALO has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (15)
2024 (4)
2023 (4)
2022 (4)
Funding
Current Stage
Late StageLeadership Team
Recent News
Company data provided by crunchbase