VP, Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Galaxy · 9 hours ago

VP, Site Reliability Engineer

Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress in finance and artificial intelligence. They are seeking a Senior Site Reliability Engineer specializing in AWS and containerized infrastructure to architect, deploy, and maintain robust AWS-based infrastructure while driving migration initiatives and ensuring workload reliability.

Asset ManagementBankingCryptocurrencyFinancial ServicesFinTech
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Architect, deploy, and maintain robust, scalable, secure AWS-based infrastructure
Drive adoption and optimization of EKS and Kubernetes for containerized workloads
Support migration initiatives, moving workloads from legacy VMs to containers in AWS
Implement and fine-tune SLOs, SLAs, and error budgets to balance innovation and stability
Collaborate on best practices with Security and Engineering teams for workload reliability
Build Infrastructure as Code (IaC) with Terraform; maintain compliant, repeatable environments
Enhance CI/CD pipelines for efficient, secure, and reliable cloud delivery
Develop and refine automated solutions for autoscaling, failover, and disaster recovery
Design and implement metrics, logging, and tracing tools (Datadog, OpenTelemetry)
Set up robust monitoring and alerting to proactively detect and address failures
Lead incident analysis and post-mortems; drive improvements in operational playbooks
Serve as a subject matter expert for AWS, EKS, and cloud-native tooling within the SRE team
Optimize AWS resources, cost management, and resiliency best practices
Ensure secure key management and regulatory compliance for decentralized workloads

Qualification

AWSKubernetes/EKSInfrastructure as CodeObservability stacksTerraformCloud-native automationIncident managementAnalytical skillsProblem-solvingClear communication

Required

8+ years in SRE, DevOps, or Infrastructure Engineering (IC capacity preferred)
Deep hands-on expertise in AWS, Kubernetes/EKS, and containerization
Extensive IaC experience (Terraform) and cloud-native automation
Proven track record migrating VM-based workloads to containers in AWS at scale
Strong experience with observability stacks (Datadog, Prometheus, Grafana, OpenTelemetry)
Excellent analytical, problem-solving, and incident management abilities
Clear communicator who thrives in team environments, collaborating cross-functionally

Preferred

Experience supporting blockchain infrastructure is a strong plus

Company

Galaxy

twittertwittertwitter
company-logo
Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress in finance and artificial intelligence.

H1B Sponsorship

Galaxy has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (21)
2024 (5)
2023 (7)
2022 (14)
2021 (6)

Funding

Current Stage
Public Company
Total Funding
$4.71B
Key Investors
Michael NovogratzHCM Capital
2025-12-11Post Ipo Debt· $50M
2025-10-27Post Ipo Debt· $1.15B
2025-10-10Post Ipo Equity· $460M

Leadership Team

leader-logo
Sam Englebardt
Co-Founder and Managing Director
linkedin
leader-logo
Luka Jankovic
Portfolio Manager
linkedin
Company data provided by crunchbase