Mambu · 2 months ago
Service Reliability Engineer
Mambu is a leading SaaS cloud banking platform on a mission to improve banking for a billion people. As a Service Reliability Engineer, you will be responsible for ensuring system reliability, managing live production incidents, and implementing observability practices to enhance the customer experience.
BankingFinancial ServicesFinTechLendingSaaSSoftware
Responsibilities
Own Live Production Incidents: You'll be the first responder for production issues impacting Mambu's mission-critical workloads. Your excellent troubleshooting skills will be crucial for investigating and resolving issues quickly, ensuring minimal disruption and fast recovery for our customers
Design and Define Observability: Design, maintain, and evolve monitoring, alerting, and logging to catch issues before customers do. You'll have a strong understanding of what "good" looks like for monitoring API performance, and you'll define and implement the standards that ensure our platform's health
Lead Incident Communication: You will be the direct point of contact for customers during critical incidents. Your exceptional communication and analytical skills will be essential for balancing technical priorities with customer impact and providing clear, confident updates
Empower and Automate: You will build and document robust knowledge bases, lead training sessions and automate repetitive support processes with an AI-first mindset
Champion Resilience and Operational Excellence: Partner with product and infrastructure teams to embed reliability, capacity management, and operational excellence into how we build software. You will advocate for best practices like blameless post-mortems, SLO/SLI design, and incident command
Qualification
Required
Exceptional troubleshooting skills and hands-on experience resolving complex production incidents in a mission-critical environment
Deep understanding of observability stacks (Prometheus, Grafana, ELK, OpsGenie, Datadog, etc.) and demonstrated experience defining and implementing alerting and monitoring setups
Excellent communication and customer-facing skills, with the ability to manage direct customer conversations during high-stakes situations
Experience with public cloud services (AWS, GCP, or Azure), distributed systems, and cloud-native applications
Proficiency in scripting/programming (Bash, Python, Go, or Java), along with software engineering skills in Java for troubleshooting and debugging production issues
An AI-first mindset and experience leveraging AI to proactively identify problems, automate support processes, and optimize workflows
SQL knowledge for querying, troubleshooting, and performance tuning
Familiarity with the software delivery lifecycle, CI/CD practices, and a DevOps culture
Preferred
Knowledge of version control systems (Git/GitHub)
Passion for automation, resilience engineering, and scaling operations
Certification with one of the cloud providers (AWS, Google Cloud or Azure)
Benefits
Company equity for all
Learning and development opportunities
Hybrid/Remote working (location dependant)
30 day working abroad
4 week paid sabbatical after 5 years service
Additional benefits based on location
Company
Mambu
Mambu is the only true SaaS cloud core banking platform.
H1B Sponsorship
Mambu has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (4)
2024 (2)
2022 (1)
2021 (5)
Funding
Current Stage
Late StageTotal Funding
$448.32MKey Investors
EQTTCVBessemer Venture Partners
2021-12-09Series E· $265.4M
2021-01-07Series D· $134.96M
2019-02-18Series C· $33.93M
Recent News
Fintech Hong Kong
2026-01-14
FF News | Fintech Finance
2025-12-03
2025-11-04
Company data provided by crunchbase