OneMain Financial · 1 month ago
Site Reliability Engineering (SRE) Team Lead
OneMain Financial is the leader in offering nonprime customers responsible access to credit. They are seeking a highly skilled Site Reliability Engineering Team Lead to guide their SRE team, ensuring operational excellence and reliability across their infrastructure while mentoring team members and driving strategic initiatives.
CreditFinanceFinancial ServicesInsuranceWealth Management
Responsibilities
Lead, mentor, and grow a team of site reliability engineers, promoting a culture of reliability, automation, and continuous improvement
Drive the design, implementation, and maintenance of scalable and fault-tolerant infrastructure to support high-availability services
Oversee incident management processes, including triage, root cause analysis, and postmortems to improve system reliability and prevent recurrence
Collaborate cross-functionally with software engineering, product, and operations teams to integrate reliability best practices into the software development lifecycle
Define and implement operational metrics, SLIs/SLOs, and dashboards to monitor system health and drive proactive improvements
Manage and assess the observability of critical environments proactively addressing gaps that may arise
Oversee the release management processes, artifacts and tools that drive a repeatable software delivery lifecycle
Champion automation efforts to reduce manual intervention, improve deployment pipelines, and optimize infrastructure management
Lead capacity planning, disaster recovery, and performance tuning efforts
Ensure security and compliance standards are upheld across infrastructure and operations
Qualification
Required
BA/BS in Computer Science, Engineering, related field, or equivalent experience
7+ years of experience in site reliability engineering, systems engineering, or related roles, with at least 2 years in a leadership position
Proven experience leading and scaling high-performing engineering teams
Deep expertise in cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes, Docker)
Strong skills in infrastructure as code tools (Terraform, Ansible, CloudFormation) and CI/CD pipelines
Proficiency with monitoring and alerting systems (Prometheus, Grafana, ELK, Datadog)
Solid programming and scripting skills (Python, Go, Bash, or similar)
Strong understanding of distributed systems, networking, security, and databases
Excellent leadership, communication, and collaboration skills
Experience managing incident response and on-call rotations
Preferred
Experience working with microservices and event-driven architectures
Familiarity with compliance frameworks such as GDPR, PCI, SOX, or SOC 2
Background in DevOps practices and tooling
Benefits
Health and wellbeing options including medical, prescription, dental, vision, hearing, accident, hospital indemnity, and life insurances
Up to 4% matching 401(k)
Employee Stock Purchase Plan (10% share discount)
Tuition reimbursement
Paid time off (15 days’ vacation per year, plus 2 personal days, prorated based on start date)
Paid sick leave as determined by state or local ordinance, prorated based on start date
Paid holidays (7 days per year, based on start date)
Paid volunteer time (3 days per year, prorated based on start date)
Company
OneMain Financial
OneMain Financial has been offering responsible and transparent loans for over 100 years.
H1B Sponsorship
OneMain Financial has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2021 (1)
Funding
Current Stage
Public CompanyTotal Funding
$2.9B2025-03-13Post Ipo Debt· $600M
2024-11-04Post Ipo Debt· $900M
2018-01-04Post Ipo Secondary· $1.4B
Recent News
2025-11-08
2025-10-29
Company data provided by crunchbase