NMI ยท 1 day ago
Staff DevOps Infrastructure Engineer
NMI is a company that empowers partners with innovative payment solutions. They are seeking a Staff DevOps Infrastructure Engineer to lead maintenance and operations for production environments, enhance automation, and improve observability in a mission-critical setting.
Financial Services
Responsibilities
Lead maintenance and operations for production and development environments, including patching, deployments, and server management
Ensure high reliability and performance of services, proactively resolving incidents before customer impact
Participate in 24x7 on-call rotations and drive post-incident reviews and blameless post-mortems
Architect and implement complex solutions spanning OS, virtualization, network, storage, and cloud layers
Coordinate on-site deployments in colocation facilities (server/storage installation, decommissioning, and troubleshooting)
Lead automation initiatives for infrastructure provisioning and operational tasks
Design and maintain tooling for observability using OSS and commercial platforms (Grafana, Prometheus, ELK)
Partner with product, security, and engineering teams to deliver infrastructure that meets compliance, performance, and scale requirements
Continuously improve SRE/DevOps practices, driving documentation quality, operational maturity, and agility
Qualification
Required
8+ years in DevOps, SRE, or infrastructure engineering roles
Proven experience in hybrid infrastructure with strong colocation and on-prem expertise (not exclusively public cloud)
Proficiency in configuration-as-code tooling (Ansible, Puppet) and scripting (Python, Bash, Go)
Expert-level Linux systems knowledge (RHEL-based distributions preferred)
Experience with Proxmox, KVM, or VMWare in high-availability environments
Advanced troubleshooting of SANs, load balancers, and virtualization platforms
Proactive infrastructure monitoring using commercial or OSS alerting systems
Preferred
Experience with F5 BigIP LTMs, NetApp SANs, or similar enterprise systems
Familiarity with observability stacks such as Grafana, Prometheus, ELK
Working knowledge of MySQL and Kubernetes (or motivation to learn it)
Prior exposure to SaaS-based WAF/DDoS platforms (CloudFlare, Akamai, Silverline)
Experience in agile teams (Scrum, Kanban) and fast-moving scale-up environments
Hands-on GitLab experience is a plus
Benefits
A remote first culture!
Flex PTO
Health, Dental and Vision Insurance
13 Paid Holidays
Company volunteer days