SIGN IN
Manager of Site Reliability Engineering jobs in United States
cer-icon
Apply on Employer Site
company-logo

UKG · 14 hours ago

Manager of Site Reliability Engineering

UKG is a company that focuses on workforce management solutions, and they are seeking a Manager of Site Reliability Engineering. This role is responsible for application reliability, performance, and operability, leading teams to develop solutions that enhance service delivery and support Cloud Engineering and Infrastructure.
Bookkeeping and PayrollHuman ResourcesSoftware
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Be a Technology Leader by driving the roadmap execution and running the project(s) while planning new ones
Help drive change across the company, working towards a common methodology based around Site Reliability Engineering and Solid System Engineering practices
Lead the team in driving further adoption of Site Reliability practices such as Chaos engineering, SLOs, Error Budgets, release safety, load testing, and disaster recovery strategies
Build teams through hiring and people growth while balancing your ownership workload through delegation and define and review individual and team goals (OKRs)
Responsible for guiding and encouraging the personal and technical development, engagement, and growth of your direct reports
Own application performance, scalability, and availability in production environments
Diagnose and resolve systemic reliability issues across application, OS, and infrastructure layers
Lead major incident response and act as the escalation point for platform-related reliability issues
Ensure post-incident reviews result in measurable improvements to platform stability and application performance
Partner with application teams to influence design decisions that impact runtime reliability
Collaborate cross organization to successfully complete successful delivery with the wider functions, including but not limited to Security, Architecture, Operations and Product Managers

Qualification

Public Cloud applicationsContainerization TechnologiesPerformance tuning.NET runtime behaviorPeople managementMetric generationLog aggregationDistributed tracingChaos engineeringSLOsError BudgetsLoad testingDisaster recoveryGCP Cloud environmentHiring SRE teams

Required

Engineering degree, or a related technical discipline, or equivalent work experience
Knowledge of Public Cloud based applications & Containerization Technologies
Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing
Experience transforming teams and successfully leading them through change
5+ year of people management experience leading a technical team
Deep understanding of Windows Server internals (memory management, threading, I/O, services)
Experience with .NET runtime behavior (GC, memory leaks, thread pools, IIS)
Performance tuning of monolithic .NET applications in production environments

Preferred

Experience working in a GCP Cloud environment
Experience with hiring SRE, DevOps, or similar engineering team

Benefits

Performance-based bonus plan
Restricted stock unit awards

Company

UKG™delivers HCM, payroll, HR service delivery, and workforce management solutions. It is a sub-organization of Hellman & Friedman.

H1B Sponsorship

UKG has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (37)
2024 (25)
2023 (33)
2022 (48)
2021 (36)

Funding

Current Stage
Late Stage
Total Funding
unknown
2007-03-23Acquired

Leadership Team

leader-logo
Jennifer Morgan
Chief Executive Officer
linkedin
leader-logo
Jim Joudrey
EVP and CTO
linkedin
Company data provided by crunchbase