SAP Taulia · 9 hours ago
Director of Cloud Infrastructure
SAP Taulia is a fintech company seeking a Director of Cloud Infrastructure to lead their global Cloud Operations function. This role is responsible for ensuring the reliability, availability, and performance of production services across multiple cloud data centers while leading a team of operations engineers and collaborating with cross-functional departments.
Financial Services
Responsibilities
Manage production operations across three global GCP data centers, prioritizing 24/7 availability, scalability, and resilience
Evolve operational standards, including runbooks, escalation paths, and operational readiness reviews (ORR)
Collaborate with Engineering to define SLOs/SLAs, manage error budgets, and oversee capacity and disaster recovery planning
Lead release planning and change control; partner with Engineering on progressive delivery and automated rollbacks
Enforce go/no-go frameworks and ensure all changes meet SAP guidelines and PCI DSS standards
Drive blameless post-mortems to improve change success rates and system stability
Lead major incident response (IR) and establish clear command structures for rapid resolution
Ensure corrective actions are prioritized and executed to prevent systemic recurrence
Track and report on reliability trends, MTTR, and long-term remediation progress for executive leadership
Define the strategy for logging, tracing, and monitoring to ensure rapid issue detection
Drive automation across provisioning and routine tasks to eliminate manual intervention and human error
Utilize operational KPIs (MTTR, change failure rate, toil metrics) to guide process improvements
Maintain audit readiness for PCI DSS and SAP infrastructure guidelines; manage access controls and evidence collection
Implement FinOps practices to monitor and optimize cloud spend, focusing on right-sizing and waste reduction without sacrificing performance
Oversee key partnerships for infrastructure, monitoring, and CI/CD tooling
Lead and coach a global team of ~20 engineers, defining career paths and performance expectations
Foster an inclusive, high-performing environment with strong cross-functional rhythms across Security, Product, and Architecture
Qualification
Required
10+ years in an operations leadership role (e.g., Cloud Operations, SRE/Production Engineering, Infrastructure Operations, NOC leadership), including responsibility for production availability and incident response, along with prior experience as an operations engineer and/or development engineer
Experience leading globally distributed teams supporting 24x7 operations and on-call programs
Proven track record owning operational outcomes for SaaS/cloud platforms, including incident management, change/release processes, and service reliability
Strong experience with cloud infrastructure and operations (Google Cloud Platform preferred), including multi-region architecture and disaster recovery patterns
Demonstrated experience operating in regulated or compliance-driven environments, including PCI DSS or similar control frameworks
Demonstrated ability to partner effectively across Engineering, Security, and Product to balance delivery speed with operational risk and compliance requirements
Experience implementing operational processes and governance (e.g., ITIL-inspired incident/problem/change management) in a pragmatic way
Strong communication skills with the ability to lead under pressure, align stakeholders, and provide executive-ready reporting
Preferred
Production operations leadership: incident command, escalation management, service ownership, operational readiness
Release management and change governance: risk management, dependency coordination, deployment controls
Observability strategy: monitoring/alerting, logging, tracing, SLOs, dashboards, alert quality improvements
Reliability engineering practices: automation, reducing toil, post-incident learning, resilience testing, DR planning
Cloud fluency (GCP preferred): infrastructure, networking fundamentals, security considerations, capacity/performance planning
Compliance and controls mindset: audit readiness, evidence-driven operations, PCI DSS-aligned operational practices
Cost management / FinOps: cost visibility, forecasting, optimization, and governance for cloud environments
Metrics-driven management: MTTR, availability, change failure rate, deployment frequency, operational KPI design
People leadership: hiring, coaching, performance management, org design, operating cadence
Benefits
Flexible work schedule
Remote-friendly environment
Comprehensive Insurance Coverage (Medical, Dental, Vision, Life)
Comprehensive PTO Structure (PTO, Sick Leave, Bereavement)
Global Parental Leave
Company issued equipment (Laptop, monitor, etc.)
401k with match
Career Development/Pathing
EAP Program/Mental Health Advocacy
Supportive Work Culture
Company
SAP Taulia
SAP Taulia, a global fintech leader, delivers AI-powered working capital solutions that unlock liquidity and drive supply chain resilience.
Funding
Current Stage
Late StageCompany data provided by crunchbase