Lead Site Reliability Engineer – Cloud Platform (AWS) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Toyota Financial Services Corporation · 5 months ago

Lead Site Reliability Engineer – Cloud Platform (AWS)

Toyota Financial Services is part of Toyota, a world-leading brand in mobility solutions. They are seeking a skilled Lead Site Reliability Engineer to enhance the reliability and automation of their AWS infrastructure, collaborating with various teams to ensure system resilience and operational efficiency.

Financial Services
badNo H1Bnote

Responsibilities

Operate and optimize cloud-native infrastructure in AWS, with a focus on EKS, Lambda, CloudWAN, Systems Manager, and ECR
Build and maintain self-healing automation workflows to reduce manual toil and improve uptime
Create and manage AWS Systems Manager (SSM) Automation Documents for operational efficiency
Define and track SLIs/SLOs and error budgets to improve system reliability
Implement observability using Dynatrace and AWS-native tools (e.g., CloudWatch)
Develop and maintain infrastructure as code using Terraform for repeatable, scalable deployments
Enhance and support CI/CD pipelines using GitHub and Harness
Participate in incident management, on-call rotations, and lead blameless postmortems
Collaborate with cloud development teams to improve architecture, delivery, and system performance
Troubleshoot cloud infrastructure and networking issues and perform root cause analysis (RCA)
Continuously identify opportunities to improve reliability, performance, and operational processes

Qualification

AWSSRE principlesTerraformCI/CD toolsPythonBashDynatraceCloudWatchNetworking knowledgeCollaboration skills

Required

7+ years of experience in SRE, DevOps, or Cloud Infrastructure roles
Solid understanding of SRE principles: SLIs, SLOs, error budgets, incident response
Hands-on experience with AWS services such as EKS, Lambda, CloudWAN, EC2, S3, RDS, Redshift, Systems Manager
Strong knowledge of network architecture and protocols within AWS
Experience building automated remediation and self-healing systems
Proficiency with Terraform, Python, Bash, and infrastructure as code principles
Experience with CI/CD tools (GitHub, Harness) and observability platforms (Dynatrace, CloudWatch)
Familiarity with ITSM processes and cloud security best practices
Excellent troubleshooting, problem-solving, and collaboration skills
Ability to work independently and within a cross-functional team environment

Preferred

Bachelor's degree in Information Technology or related field
AWS Certifications (e.g., DevOps Engineer, Solutions Architect)
Experience with integration tools like MuleSoft, Apache Camel, or message streaming platforms

Benefits

A work environment built on teamwork, flexibility, and respect
Professional growth and development programs to help advance your career, as well as tuition reimbursement
Team Member Vehicle Purchase Discount
Toyota Team Member Lease Vehicle Program (if applicable)
Comprehensive health care and wellness plans for your entire family
Toyota 401(k) Savings Plan featuring a company match, as well as an annual retirement contribution from Toyota regardless of whether you contribute
Paid holidays and paid time off
Referral services related to prenatal services, adoption, childcare, schools and more
Tax Advantaged Accounts (Health Savings Account, Health Care FSA, Dependent Care FSA)
Relocation assistance (if applicable)

Company

Toyota Financial Services Corporation

twitter
company-logo
Toyota Financial Services Corporation is made up of affiliates in more than 35 countries/locations.

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Brajesh Kumar
Chief Technology Officer
linkedin
leader-logo
Kris Pritchard
Vice President & Chief Risk Officer
linkedin
Company data provided by crunchbase