Site Reliability Engineer III jobs in United States
cer-icon
Apply on Employer Site
company-logo

Apex Systems ยท 6 days ago

Site Reliability Engineer III

Apex Systems is a world-class IT services company that serves thousands of clients across the globe. They are seeking a Site Reliability Engineer III responsible for building and managing the reliability of an Internal Cloud Container Platform, monitoring performance issues, and collaborating with engineering teams to enhance operational excellence.

Human ResourcesInformation TechnologyRecruiting
check
H1B Sponsor Likelynote

Responsibilities

Responsible for building, managing reliability and support of Internal Cloud Container Platform on-prem
Monitor and troubleshoot Container platform (Openshift) Rancher and VKS (Vmware Kubernetes Service - TKG) environment performance issues, connectivity issues, security issues, etc
Perform deep dives into systemic and latent reliability issues, Incident management, problem management
Identifying, analyzing, and resolving infrastructure vulnerabilities and application deployment issues
Perform blameless RCA, partner with engineering and operation teams across the organization to roll out fixes
Responsible for application onboarding and provide troubleshooting support through the lifecycle of the applications on the container platform
Identify and drive opportunities to improve automation to reduce TOIL and improve operational excellence
Partner with risk, and compliance teams to bring visibility and implement right controls and remediation of vulnerabilities
Ensure resiliency during implementation and identify/fix resiliency problems by collaborating with engineering teams
Be a key stakeholder in the design of cloud services and work with Architecture, engineering, product teams
Participate in 24x7 on-call coverage follow the sun model

Qualification

KubernetesOpenShiftContainer securityLinux OSPythonAnsibleTerraformCI/CD toolsMonitoring toolsGolangShell scriptingRancherAgile methodologiesProblem-solvingCommunication skills

Required

BS /MS degree in Computer Science or related technical field involving systems or equivalent practical experience
Minimum 5+ years of hands-on experience supporting Kubernetes /Openshift Container platform
Experience with Python, Ansible, Golang, and shell scripting
Strong experience in major services related to Compute, Storage, Network and Security
Experience with monitoring tools like Prometheus and Dynatrace
Experience working with a complex IAM infrastructure, including Active Directory, and Ping Identity or other SSO solutions
Advanced knowledge of Linux OS, DNS, DHCP, Kerberos and Windows Authentication
Experience with CI/CD tools git /Jenkins, GitOps model
Excellent understanding of Linux /Windows operating systems administration
Experience in Container security and vulnerability remediation
Experience with OpenShift virtualization on VMware ESX, Container networking, VKS (TKG)
Systematic problem-solving approach, sense of ownership and drive
Ability to juggle competing priorities and adapt to changes in project scope
Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must
Proven ability to work independently with minimal supervision and as part of a team with direct responsibilities
Experience in Openshift, CSP Kubernetes services such as VKS (TKG), Rancher
Experience in Terraform, ArgoCD, Tekton, and K-native technologies
Experience in agile deployment methodologies (GitOps)
Knowledge of various containers run time
Familiarity with the operator deployment pattern
Experience working in a highly available multidata center environment
Experience working with monitoring tools such as Prometheus, Splunk, Dynatrace, Sysdig, or similar tools
Understanding cost management, inventory management

Benefits

Medical
Dental
Vision
Life
Disability
Other insurance plans
ESPP (employee stock purchase program)
401K program
HSA (Health Savings Account on the HDHP plan)
SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions
Corporate discount savings program
Other discounts
On-demand training program
Access to certification prep
Library of technical and leadership courses/books/seminars
Certification discounts and other perks to associations that include CompTIA and IIBA
Dedicated customer service team for our Consultants
Certified Career Coach

Company

Apex Systems

company-logo
Apex Systems, a division of On Assignment, provides organizations with IT staffing solutions to address gaps in their current workforce.

H1B Sponsorship

Apex Systems has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (28)
2024 (21)
2023 (35)
2022 (26)
2021 (29)
2020 (38)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Roger Wahman
Chief Technology Officer - SVP
linkedin
leader-logo
Andrea Schiola
Global Head of Technology Partnerships, SVP, Principal
linkedin
Company data provided by crunchbase