Smiley Technologies, Inc. · 6 hours ago
Site Reliability Engineer
Smiley Technologies, Inc. powers core banking platforms used by banks and credit unions across the United States. They are hiring a Site Reliability Engineer to enhance reliability practices, incident response, and observability, while working closely with various technical teams.
AppsInformation TechnologyIT Infrastructure
Responsibilities
Work cross-functionally with Network, SecOps, DevSecOps, Platform, Developers, and Support
Own and evolve observability and monitoring, primarily using Dynatrace
Dashboards, alerts, reporting, and adoption across teams
Help teams improve root cause analysis, retrospectives, and reliability practices
Support and improve CI/CD pipelines (GitHub Actions / Azure DevOps)
Maintain standards and documentation across the SDLC
Monitor and optimize cloud costs, infrastructure tiers, and capacity
Participate in Incident Command on-call rotation
Define and document SLIs, SLOs, SLAs, KPIs, and OKRs
Promote both Shift Left and Shift Right reliability thinking
Help Smiley continue its DevOps and Platform transformation
Qualification
Required
2+ years in an SRE, Platform Engineer, or DevOps role
Hands-on experience with APM / observability tools
Dynatrace strongly preferred
Experience with Azure or AWS
Experience supporting CI/CD pipelines
Experience with containers (Docker, AKS)
Working knowledge of Git, Terraform, Helm, Bash, PowerShell
Experience supporting REST APIs
Experience with .NET or Python (or similar)
Preferred
Linux/Unix administration fundamentals
Performance troubleshooting across applications
Performance troubleshooting across databases (SQL Server, DB2, PostgreSQL)
Familiarity with WAFs, networking, and OWASP concepts
Experience with developer portals (Backstage.io a plus)
Financial services or regulated environments (helpful, not required)