Scalence L.L.C. · 1 month ago
Network/Operations - Dev Ops Engineer
Scalence L.L.C. is seeking a Senior DevOps / Site Reliability Engineer for their enterprise-scale API Gateway platform team. This role involves managing a critical application that handles billions of transactions daily, requiring expertise in AWS, CI/CD, Python, and infrastructure troubleshooting.
Information Technology & Services
Responsibilities
Work as part of a ~30-member platform team supporting an enterprise API Gateway
Participate in weekly on-call rotations:
Primary and Secondary on-call roles
Rotation runs Thursday to Wednesday
Support production releases, including:
Weekly paper releases
Bi-weekly night releases (typically 9 PM – 12 AM)
Troubleshoot high-priority incidents involving:
API traffic issues
Network and certificate failures
Infrastructure or database latency
Collaborate closely with engineers, IT support, and leadership
Attend daily standups (10 AM) and operational calls as required
Act as a first responder for gateway-related incidents impacting internal and external customers
Qualification
Required
Strong hands-on experience with AWS (ECS, EKS, NLBs, Aurora, CloudWatch, large-scale infra)
Python development experience (debugging, scripting, feature fixes)
CI/CD pipeline experience (GitHub, Jenkins, GitLab, or similar tools)
Strong production support & troubleshooting skills across: Infrastructure, Networking, Certificates / TLS, Databases
Experience supporting high-throughput, mission-critical systems
Ability to debug issues end-to-end with low MTTD (Mean Time to Detect)
Willingness and ability to work onsite in Richmond from Day 1
No visa sponsorship requirement
Preferred
Prior Capital One experience
Banking or financial services domain experience
Experience with Kong API Gateway (or other API gateway platforms)
SRE background supporting large enterprise platforms
Experience handling large-scale traffic (hundreds of TPS or more)
Company
Scalence L.L.C.
In today’s dynamic and competitive market, success hinges on mastering three key areas: Data Intelligence, Business Resilience, and Digital Experience.
Funding
Current Stage
Late StageCompany data provided by crunchbase