Turion Space · 9 hours ago
Lead Site Reliability Engineer
Turion Space is looking for a Lead Site Reliability Engineer to help build monitoring and reliability systems that keep satellites connected to Earth. The role involves building observability infrastructure and potentially leading teams while ensuring the operation of space communications systems 24/7 for various customers.
AerospaceInformation TechnologyManufacturingSpace Travel
Responsibilities
Build monitoring and reliability systems that keep satellites connected to Earth
Build the observability infrastructure that ensures our space communications systems operate 24/7 for customers ranging from commercial satellite operators to national security missions
Evolve from building core monitoring systems to potentially leading teams and architecting global-scale reliability platforms
Work directly with our platform engineering team to establish the monitoring, alerting, and deployment practices that will scale with us from startup to enterprise
Qualification
Required
5+ years of relevant hands-on experience in production operations and 1-2+ years in a technical leadership role
Ability to work across multiple engineering disciplines and with diverse teams with strong communication and minimal oversight
Experience with observability tools (Grafana, Prometheus, Loki, Alloy, ELK) in production environments
Hands-on experience with DR planning, failure mode analysis, and building resilient systems with automated failover and recovery
Familiarity with HashiCorp Vault, Okta, or similar identity/secrets management systems
Previous experience scaling infrastructure at high-growth companies (startup to 100+ employees)
Linux system administration experience and networking fundamentals
Strong experience with Kubernetes, Docker, and container orchestration in production environments
Hands-on experience with CI/CD tools and infrastructure as code (Terraform or Crossplane preferred)
AWS experience with multi-service deployments and programming skills for automation (Bash, Python)
Self-directed work style with ability to own projects from conception to production in fast-moving environments
Understanding of SRE principles, SLOs/SLIs, and systematic approaches to system reliability
Preferred
Demonstrated success in executing large projects on tight timelines
AWS certification or demonstrated expertise with advanced cloud networking and security
Interest in aerospace, telecommunications, or mission-critical systems
Already has a Secret or TS/SCI clearance that can be maintained
Benefits
Equity: Receive equity in Turion Space, letting you benefit from the company's success
Health Insurance: Comprehensive medical, dental, and vision coverage for employees and their dependents.
Retirement Plans: Access to a 401(k) plan to help you plan for your future.
Paid Time Off: Generous vacation days, personal days, sick days, and holidays to ensure you have time to recharge.
Professional Development: Opportunities for ongoing training, workshops, and courses to advance your skills and career growth.
Team Building Activities: Regular social events, team outings, and company-sponsored activities to foster a positive work environment.
Company
Turion Space
Turion Space designs, builds, and operates DROID satellites to address pressing national security needs.
Funding
Current Stage
Growth StageTotal Funding
$27.82MKey Investors
Veteran Ventures Capital
2024-12-02Series A· $20M
2022-03-01Seed· $1.5M
2021-09-01Seed· $6.2M
Recent News
Business Insider
2025-12-25
2025-10-02
Company data provided by crunchbase