Jobs via Dice · 7 hours ago
Site Reliability Engineer (SRE)
Dice is the leading career destination for tech experts at every stage of their careers. Our client, EVEREST CONSULTING GROUP, INC, is seeking a highly skilled Site Reliability Engineer who will be responsible for maintaining and improving the reliability, performance, and availability of software systems.
Computer Software
Responsibilities
Creating and supporting automation scripts (shell/ansible/python) for infrastructure deployments, validations, and monitoring to improve operational tasks
Scheduling monitoring scripts using cron and airflow
Monitoring using tools including Dynatrace, Apica, Grafana etc
Database handling
Build CICD pipelines
Incident handling and problem management
Qualification
Required
Experience in Ansible/Python
Monitoring Tools – Dynatrace/Apica/Grafana
Bachelor's degree in computer science or a related field
14 plus years of IT Infrastructure experience
Extensive experience working with Linux flavors like RHEL/CentOS OS, shells, filesystems, and utilities
Experience in programming languages like Python, Ansible
Knowledge of distributed computing and experience working with container orchestration frameworks including on-prem and Rancher Kubernetes with good knowledge of Kubernetes objects
Experience working with Storage, ONTAP is preferable: volume, aggregates, backups, DR planning
Experience scheduling monitoring scripts using cron and airflow
Experience with monitoring tools including Dynatrace, Apica, Grafana etc
Database knowledge including SQL and NoSQL DBs
Cloud platform knowledge (specifically AWS) is required
Preferred
Experience building CICD pipelines (preferred)
Company
Jobs via Dice
Welcome to Jobs via Dice, the go-to destination for discovering the tech jobs you want.
Funding
Current Stage
Early StageCompany data provided by crunchbase