Releady · 2 months ago
SRE Engineer (Remote - West Coast, PST Hours)
Releady is a company that focuses on providing observability solutions for clients, including a Washington-based airline. The SRE Engineer role involves ensuring the observability stack operates smoothly, building dashboards, integrating data sources, and responding to various requests to maintain system visibility.
ConsultingRecruitingStaffing Agency
Responsibilities
Build Grafana dashboards for teams and leadership
Create and refine alert rules, logs, and basic SLO configurations
Maintain and tune integrations between observability tools
Respond to incoming requests (new dashboards/panels, data source connections)
Onboard new applications into AppDynamics or similar tools
Develop small automation tasks with Ansible Automation Platform to reduce manual work
Build and maintain Grafana dashboards for internal teams and leadership
Operate and tune observability tools; triage and fulfill incoming requests
Connect and manage data sources across Grafana, Sumo Logic, AppDynamics, New Relic, etc
Help teams implement alerting, logging structure, and basic SLOs
Instrument new applications for monitoring and assist with onboarding
Operationalize tools inherited from networking, CSA Mobile, and SDP
Create repeatable templates and onboarding patterns for teams
Develop playbooks and small automation tasks using Ansible Automation Platform
Qualification
Required
2+ years with observability/monitoring tools (Grafana, Datadog, Sumo Logic, AppDynamics, New Relic, Kibana, etc.)
1+ year with automation tools (Ansible, Ansible Automation Platform, Ansible Tower, etc.)
2–4 years in systems/cloud/DevOps/platform/SRE-adjacent roles
Strong understanding of telemetry: metrics, logs, and traces
Reliable execution: ability to follow structured tasks and deliver consistently
Grafana (hands‑on): build dashboards, connect data sources, manage panels, and work with logs, metrics, and traces
Ansible Automation Platform (or Ansible/Tower): run playbooks, automate configuration tasks, and support infrastructure automation
Observability tooling: experience with one or more of Sumo Logic, AppDynamics, New Relic, ThousandEyes, or similar monitoring platforms
Preferred
Experience with ThousandEyes
Experience with Kubernetes or container-based platforms
Experience using AppDynamics or New Relic for application performance monitoring
Basic scripting skills
Familiarity with large enterprise operational environments