Service Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Mambu · 2 months ago

Service Reliability Engineer

Mambu is a leading SaaS cloud banking platform on a mission to improve banking for a billion people. As a Service Reliability Engineer, you will be responsible for ensuring system reliability, managing live production incidents, and implementing observability practices to enhance the customer experience.

BankingFinancial ServicesFinTechLendingSaaSSoftware
check
H1B Sponsor Likelynote

Responsibilities

Own Live Production Incidents: You'll be the first responder for production issues impacting Mambu's mission-critical workloads. Your excellent troubleshooting skills will be crucial for investigating and resolving issues quickly, ensuring minimal disruption and fast recovery for our customers
Design and Define Observability: Design, maintain, and evolve monitoring, alerting, and logging to catch issues before customers do. You'll have a strong understanding of what "good" looks like for monitoring API performance, and you'll define and implement the standards that ensure our platform's health
Lead Incident Communication: You will be the direct point of contact for customers during critical incidents. Your exceptional communication and analytical skills will be essential for balancing technical priorities with customer impact and providing clear, confident updates
Empower and Automate: You will build and document robust knowledge bases, lead training sessions and automate repetitive support processes with an AI-first mindset
Champion Resilience and Operational Excellence: Partner with product and infrastructure teams to embed reliability, capacity management, and operational excellence into how we build software. You will advocate for best practices like blameless post-mortems, SLO/SLI design, and incident command

Qualification

Troubleshooting skillsObservability stacksPublic cloud servicesScripting/programmingSQL knowledgeSoftware delivery lifecycleAI-first mindsetPassion for automationVersion control systemsCertification with cloud providersCommunication

Required

Exceptional troubleshooting skills and hands-on experience resolving complex production incidents in a mission-critical environment
Deep understanding of observability stacks (Prometheus, Grafana, ELK, OpsGenie, Datadog, etc.) and demonstrated experience defining and implementing alerting and monitoring setups
Excellent communication and customer-facing skills, with the ability to manage direct customer conversations during high-stakes situations
Experience with public cloud services (AWS, GCP, or Azure), distributed systems, and cloud-native applications
Proficiency in scripting/programming (Bash, Python, Go, or Java), along with software engineering skills in Java for troubleshooting and debugging production issues
An AI-first mindset and experience leveraging AI to proactively identify problems, automate support processes, and optimize workflows
SQL knowledge for querying, troubleshooting, and performance tuning
Familiarity with the software delivery lifecycle, CI/CD practices, and a DevOps culture

Preferred

Knowledge of version control systems (Git/GitHub)
Passion for automation, resilience engineering, and scaling operations
Certification with one of the cloud providers (AWS, Google Cloud or Azure)

Benefits

Company equity for all
Learning and development opportunities
Hybrid/Remote working (location dependant)
30 day working abroad
4 week paid sabbatical after 5 years service
Additional benefits based on location

Company

Mambu is the only true SaaS cloud core banking platform.

H1B Sponsorship

Mambu has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (4)
2024 (2)
2022 (1)
2021 (5)

Funding

Current Stage
Late Stage
Total Funding
$448.32M
Key Investors
EQTTCVBessemer Venture Partners
2021-12-09Series E· $265.4M
2021-01-07Series D· $134.96M
2019-02-18Series C· $33.93M

Leadership Team

leader-logo
Fernando Zandona
Chief Executive Officer
linkedin
leader-logo
Mark Geneste
Chief Revenue Officer
linkedin
Company data provided by crunchbase