Valigator · 4 hours ago
Junior Operations Engineer (Reliability)
Valigator operates always-on blockchain infrastructure supporting the Solana ecosystem, and they are seeking a Junior Operations Engineer (Reliability) to join their lean team. This role involves shadowing operational workflows, growing into on-call coverage, and improving runbooks while reducing operational toil through automation.
Information Technology & Services
Responsibilities
Grow into on-call coverage over time (shadow first, then take on defined incident types with backup and clear escalation paths). Once ramped, expect approximately one week every three weeks
Triage production alerts with calm judgment, clear communication, and good escalation instincts (ask early when uncertain)
Use runbooks to drive safe recovery, then improve runbooks based on what actually happened
Perform routine maintenance and upgrades in a controlled, repeatable way, and validate system health after changes
Improve monitoring and alert quality so pages are actionable, not noisy
Reduce repeated manual work by turning it into scripts, guardrails, or lightweight automation
Keep clean operational records through tickets and PRs, documenting intent, risk, and outcomes
Contribute to post-incident reviews focused on learning and reducing repeat pages (not blame)
Qualification
Required
Comfort working in Linux environments and the command line
Debugging fundamentals: logs, metrics, basic networking intuition, and systems reasoning
Calm incident response habits and solid escalation judgment
Clear written communication, especially during incidents and handoffs
Willingness to join on-call gradually and help improve it over time
Preferred
Scripting (Bash) and/or programming experience (Python especially; Rust is a plus)
Familiarity with observability tools (Datadog, Prometheus/Grafana, or similar)
Exposure to automation/IaC (Ansible, Terraform) and PR-based workflows
Comfort with hardware-adjacent troubleshooting (disk health, IO bottlenecks, capacity basics)
Familiarity with secure operations practices (least privilege, key hygiene, audit trails, change control)
Interest in crypto/Web3 (not required)
Company
Valigator
Funding
Current Stage
Early StageCompany data provided by crunchbase