Northwood · 5 months ago
Site Reliability Engineer (Space Communications)
NorthwoodSpace is on a mission to transform connectivity between earth and space through innovations in space communications technologies. They are seeking an Infrastructure Engineer to build and maintain observability infrastructure and ensure the reliability of their global space communications network as they scale operations and establish ground stations worldwide.
AerospaceHardwareSatellite Communication
Responsibilities
Build and maintain observability stack with tools like Grafana, Prometheus, Loki, Vector, CloudWatch, VictoriaMetrics, etc. for metrics and log ingestion across environments
Support and improve CI/CD pipelines using GitLab and ArgoCD, collaborating with development teams on deployment best practices
Help build and maintain cloud infrastructure using Terraform on AWS, contributing to the scalability and reliability of our space communication systems
Work with senior engineers to establish monitoring strategies, alerting, and incident response procedures
Deploy and manage Kubernetes applications using Helm charts, with focus on reliability and developer experience
Collaborate with engineering teams to implement performance monitoring and troubleshooting across microservices
Support identity and access management integration with Okta and HashiCorp Vault
Assist in managing NixOS-based infrastructure for reproducible system configurations
Participate in incident response efforts and contribute to post-incident reviews and improvements
Qualification
Required
2-4 years of hands-on experience with infrastructure tools and monitoring systems in production environments
Experience with containerization (Docker, Kubernetes) and basic container orchestration
Familiarity with CI/CD tools (GitLab, Jenkins, or similar) and infrastructure as code concepts
Experience with cloud platforms (AWS preferred) and basic infrastructure automation
Programming skills in Python or similar language and experience with configuration management
Startup mentality with ability to work in fast-paced, high-growth environments and take on diverse responsibilities
Experience with logging and metrics collection for production systems
Understanding of system reliability principles and interest in learning SRE practices
Preferred
Some exposure to observability tools like Vector, Loki, Grafana, Prometheus, or similar monitoring systems
Experience with Terraform or other infrastructure as code tools
Familiarity with NixOS or other declarative system configuration approaches
Basic knowledge of HashiCorp Vault, Okta, or similar identity/secrets management tools
Interest in distributed systems and troubleshooting complex technical issues
Previous startup experience or demonstrated ability to learn quickly and adapt
Linux system administration experience
AWS certification or demonstrated cloud platform knowledge
Company
Northwood
Northwood was founded by Bridgit Mendler, Griffin Cleverly, and Shaurya Luthra with the mission to expand access to space by transforming satellite backhaul infrastructure.
Funding
Current Stage
Growth StageTotal Funding
$136.4MKey Investors
Andreessen Horowitz,Washington Harbour PartnersAlpine Space Ventures,Andreessen HorowitzAndreessen Horowitz,Founders Fund
2026-01-27Series B· $100M
2025-04-22Series A· $30M
2024-02-19Seed· $6.3M
Recent News
2026-02-06
Satellite Today
2025-07-11
Company data provided by crunchbase