Site Reliability Engineer jobs in United States
info-icon
This job has closed.
company-logo

Jobs via Dice · 6 hours ago

Site Reliability Engineer

Genesis10 is a leading staffing firm in the U.S., and they are seeking a Site Reliability Engineer for their client in the financial industry. The role involves ensuring the reliability and support of the Container Platform across various cloud environments and troubleshooting performance and security issues.

Computer Software

Responsibilities

Responsible for reliability and support of Container Platform on-prem and external clouds (Azure/AWS/Google)
Monitor and troubleshoot Container platform (Openshift), Rancher (RKE) and Azure (AKS) environment performance issues, connectivity issues, security issues, etc
Perform deep dives into systemic and latent reliability issues, incident management, problem management
Identify, analyze, and resolve infrastructure vulnerabilities and application deployment issues
Perform blameless RCA, partner with engineering and operation teams across the organization to roll out fixes

Qualification

KubernetesOpenshiftPythonAzureAnsibleGolangLinux OSCI/CD toolsContainer securitySoft skills

Required

BS/MS degree in Computer Science or related technical field involving systems or equivalent practical experience
Minimum 5+ years of hands-on experience supporting Kubernetes/Openshift/RKE/EKS Container platform
Experience with Python, Ansible, Golang, and shell scripting
Strong experience in major services related to Compute, Storage, Network and Security
Experience with monitoring tools like Prometheus and Dynatrace, as well as cloud native tools like Azure Monitor and Log Analytics
Strong understanding and background of working with a complex IAM infrastructure, including Active Directory, Azure AD Connect, Azure AD, and Ping Identity or other SSO solutions
Advanced knowledge of Linux OS, DNS, DHCP, Kerberos and Windows Authentication
Experience with CI/CD tools git/Jenkins, GitOps model
Excellent understanding of Linux/Windows operating systems administration
Experience in Container security and vulnerability remediation
Systematic problem-solving approach, sense of ownership and drive
Ability to juggle competing priorities and adapt to changes in project scope
Excellent interpersonal, organizational and communication (written, verbal, and presentation)
Proven ability to work independently with minimal supervision and as part of a team with direct responsibilities
RedHat OpenShift
Kubernetes
Microsoft Azure

Preferred

Experience in Openshift, RKE, CSP Kubernetes services such as AKS and EKS
Experience in Terraform, ArgoCD, Tekton, and K-native technologies
Experience in agile deployment methodologies (GitOps)
Knowledge of various container runtimes Familiarity with the operator deployment pattern
Experience working in a highly available multidatacenter environment
Experience working with monitoring tools such as Prometheus, Splunk, Dynatrace, Sysdig, or similar tools
Understanding of cost management, inventory management, FinOps model
Kubernetes/Openshift/Terraform certifications

Benefits

Behavioral Health Platform
Medical, Dental, Vision
Health Savings Account
Voluntary Hospital Indemnity (Critical Illness & Accident)
Voluntary Term Life Insurance
401K
Sick Pay (for applicable states/municipalities)
Commuter Benefits (Dallas, NYC, SF and Illinois)

Company

Jobs via Dice

twitter
company-logo
Welcome to Jobs via Dice, the go-to destination for discovering the tech jobs you want.

Funding

Current Stage
Early Stage
Company data provided by crunchbase