Site Reliability Engineer - Fulltime Only jobs in United States
cer-icon
Apply on Employer Site
company-logo

VBeyond Corporation · 21 hours ago

Site Reliability Engineer - Fulltime Only

VBeyond Corporation is seeking a Site Reliability Engineer focused on observability, Kubernetes, and cloud infrastructure. The role involves ownership of the observability stack, building reliable monitoring pipelines, and improving cluster reliability through automation and performance tuning.

ConsultingCRMDeliveryHuman ResourcesInformation Technology
Hiring Manager
Ekta Singh
linkedin

Responsibilities

SRE role focused on observability, Kubernetes, and cloud infrastructure (AWS/GCP/EKS)
Ownership of observability stack: Prometheus, Grafana, OpenTelemetry, ELK/Loki/Splunk, Jaeger, Alertmanager, SLOs
Build and maintain reliable monitoring pipelines for metrics, logs, tracing, dashboards, and alerts
Develop Terraform modules for observability infrastructure, Kubernetes components, and cluster add-ons
Improve cluster reliability through automation, performance tuning, capacity planning, and remediation
Implement AI-assisted diagnostics for anomaly detection, alert tuning, and noise reduction
Collaborate with Platform Engineering on Istio/service mesh telemetry and platform health
Lead SLO reporting, incident management, and root cause analysis

Qualification

KubernetesTerraformObservability toolsAWSGCPAutomation (Python/Go)CI/CDCloud networkingIncident managementRoot cause analysis

Required

SRE role focused on observability, Kubernetes, and cloud infrastructure (AWS/GCP/EKS)
Ownership of observability stack: Prometheus, Grafana, OpenTelemetry, ELK/Loki/Splunk, Jaeger, Alertmanager, SLOs
Build and maintain reliable monitoring pipelines for metrics, logs, tracing, dashboards, and alerts
Develop Terraform modules for observability infrastructure, Kubernetes components, and cluster add-ons
Improve cluster reliability through automation, performance tuning, capacity planning, and remediation
Implement AI-assisted diagnostics for anomaly detection, alert tuning, and noise reduction
Collaborate with Platform Engineering on Istio/service mesh telemetry and platform health
Lead SLO reporting, incident management, and root cause analysis
4–8 years of experience in SRE, infrastructure, or Kubernetes operations
Strong expertise in observability tools, Terraform, automation (Python/Go), CI/CD, and cloud networking

Company

VBeyond Corporation

twittertwittertwitter
company-logo
VBeyond Corporation is a staffing and recruiting company specializing in emerging search and HR consulting services.

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Rajesh Khanna
President
linkedin
leader-logo
Sandeep Mitra
Director
linkedin
Company data provided by crunchbase