VP, Site Reliability Engineer jobs in United States
info-icon
This job has closed.
company-logo

Galaxy · 2 months ago

VP, Site Reliability Engineer

Galaxy Digital Services is a global leader in digital assets and data center infrastructure, focused on delivering innovative solutions in finance and artificial intelligence. The VP, Site Reliability Engineer will architect and maintain secure AWS-based infrastructure, drive the adoption of containerized workloads, and enhance observability and incident response processes.

Asset ManagementBankingCryptocurrencyFinancial ServicesFinTech
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Architect, deploy, and maintain robust, scalable, secure AWS-based infrastructure
Drive adoption and optimization of EKS and Kubernetes for containerized workloads
Support migration initiatives, moving workloads from legacy VMs to containers in AWS
Implement and fine-tune SLOs, SLAs, and error budgets to balance innovation and stability
Collaborate on best practices with Security and Engineering teams for workload reliability
Build Infrastructure as Code (IaC) with Terraform; maintain compliant, repeatable environments
Enhance CI/CD pipelines for efficient, secure, and reliable cloud delivery
Develop and refine automated solutions for autoscaling, failover, and disaster recovery
Design and implement metrics, logging, and tracing tools (Datadog, OpenTelemetry)
Set up robust monitoring and alerting to proactively detect and address failures
Lead incident analysis and post-mortems; drive improvements in operational playbooks
Serve as a subject matter expert for AWS, EKS, and cloud-native tooling within the SRE team
Optimize AWS resources, cost management, and resiliency best practices
Ensure secure key management and regulatory compliance for decentralized workloads

Qualification

AWSKubernetes/EKSTerraformContainerizationObservability stacksIncident managementAnalytical skillsClear communication

Required

8+ years in SRE, DevOps, or Infrastructure Engineering (IC capacity preferred)
Deep hands-on expertise in AWS, Kubernetes/EKS, and containerization
Extensive IaC experience (Terraform) and cloud-native automation
Proven track record migrating VM-based workloads to containers in AWS at scale
Strong experience with observability stacks (Datadog, Prometheus, Grafana, OpenTelemetry)
Excellent analytical, problem-solving, and incident management abilities
Clear communicator who thrives in team environments, collaborating cross-functionally

Preferred

Experience supporting blockchain infrastructure is a strong plus

Company

Galaxy

twittertwittertwitter
company-logo
Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress in finance and artificial intelligence.

H1B Sponsorship

Galaxy has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (21)
2024 (5)
2023 (7)
2022 (14)
2021 (6)

Funding

Current Stage
Public Company
Total Funding
$4.71B
Key Investors
Michael NovogratzHCM Capital
2025-12-11Post Ipo Debt· $50M
2025-10-27Post Ipo Debt· $1.15B
2025-10-10Post Ipo Equity· $460M

Leadership Team

leader-logo
Sam Englebardt
Co-Founder and Managing Director
linkedin
leader-logo
Luka Jankovic
Head of Lending, Managing Director
linkedin
Company data provided by crunchbase