Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Programming.com · 18 hours ago

Site Reliability Engineer

Programming.com is seeking a Senior Site Reliability Engineer (SRE) with expertise in AWS and Kubernetes. The role involves designing and operating highly available systems for banking and payments, leading SRE practices, and supporting Java microservices on Kubernetes.

ConsultingInformation ServicesInformation TechnologySoftware
badNo H1Bnote
Hiring Manager
Vishal Puri
linkedin

Responsibilities

Design and operate highly available, fault-tolerant systems for banking, payments, and trading platforms
Lead SRE practices: SLIs, SLOs, error budgets, RCA, post-incident remediation, L3/L4 on-call support
Support Java microservices on Kubernetes (EKS); optimize performance, scalability, and latency
Strong AWS experience: EC2, EKS, IAM, VPC, RDS, DynamoDB, S3, CloudWatch
Infrastructure automation using Terraform; scripting with Python, Go, Bash
Kubernetes networking, storage, and service mesh: Istio, Anthos Service Mesh, Portworx, multi-cluster/federation
CI/CD with GitLab CI/CD, Jenkins; zero-downtime deployments and DR strategies
Observability using Prometheus, Datadog, Splunk, Kiali, eBPF for deep system visibility
Real-time streaming: Kafka, KSQLDB, Kafka Streams, Spark Streaming
Security & compliance: IAM, secrets management, SOC2, PCI-DSS, SOX, banking-grade controls
Strong Linux/Unix, Docker, VMware, networking tools (Nginx Controller, Seesaw)
Experience with high-frequency transaction systems and regulated environments

Qualification

AWSKubernetesTerraformJava microservicesCI/CDSecurity & complianceLinux/UnixObservabilityPythonGoBashDockerVMwareNetworking toolsKafkaHigh-frequency transaction systemsCertifications

Required

Design and operate highly available, fault-tolerant systems for banking, payments, and trading platforms
Lead SRE practices: SLIs, SLOs, error budgets, RCA, post-incident remediation, L3/L4 on-call support
Support Java microservices on Kubernetes (EKS); optimize performance, scalability, and latency
Strong AWS experience: EC2, EKS, IAM, VPC, RDS, DynamoDB, S3, CloudWatch
Infrastructure automation using Terraform; scripting with Python, Go, Bash
Kubernetes networking, storage, and service mesh: Istio, Anthos Service Mesh, Portworx, multi-cluster/federation
CI/CD with GitLab CI/CD, Jenkins; zero-downtime deployments and DR strategies
Observability using Prometheus, Datadog, Splunk, Kiali, eBPF for deep system visibility
Real-time streaming: Kafka, KSQLDB, Kafka Streams, Spark Streaming
Security & compliance: IAM, secrets management, SOC2, PCI-DSS, SOX, banking-grade controls
Strong Linux/Unix, Docker, VMware, networking tools (Nginx Controller, Seesaw)
Experience with high-frequency transaction systems and regulated environments
AWS Solutions Architect – Professional or AWS DevOps Engineer – Professional
CKA or CKS

Company

Programming.com

twittertwittertwitter
company-logo
Programming.com is a leading software development company, providing expertise in strategy, consulting, technology and IT operations.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Shashank Munim
Managing Partner
linkedin
Company data provided by crunchbase