SIGN IN
Site Reliability Engineer (Space Communications) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Northwood · 5 months ago

Site Reliability Engineer (Space Communications)

NorthwoodSpace is on a mission to transform connectivity between earth and space through innovations in space communications technologies. They are seeking an Infrastructure Engineer to build and maintain observability infrastructure and ensure the reliability of their global space communications network as they scale operations and establish ground stations worldwide.
AerospaceHardwareSatellite Communication
check
Diversity & Inclusion
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Build and maintain observability stack with tools like Grafana, Prometheus, Loki, Vector, CloudWatch, VictoriaMetrics, etc. for metrics and log ingestion across environments
Support and improve CI/CD pipelines using GitLab and ArgoCD, collaborating with development teams on deployment best practices
Help build and maintain cloud infrastructure using Terraform on AWS, contributing to the scalability and reliability of our space communication systems
Work with senior engineers to establish monitoring strategies, alerting, and incident response procedures
Deploy and manage Kubernetes applications using Helm charts, with focus on reliability and developer experience
Collaborate with engineering teams to implement performance monitoring and troubleshooting across microservices
Support identity and access management integration with Okta and HashiCorp Vault
Assist in managing NixOS-based infrastructure for reproducible system configurations
Participate in incident response efforts and contribute to post-incident reviews and improvements

Qualification

Infrastructure toolsMonitoring systemsCI/CD pipelinesCloud infrastructureContainerizationInfrastructure as codePython programmingLoggingMetricsStartup mentalitySystem reliability principlesDistributed systemsLinux administrationAWS certification

Required

2-4 years of hands-on experience with infrastructure tools and monitoring systems in production environments
Experience with containerization (Docker, Kubernetes) and basic container orchestration
Familiarity with CI/CD tools (GitLab, Jenkins, or similar) and infrastructure as code concepts
Experience with cloud platforms (AWS preferred) and basic infrastructure automation
Programming skills in Python or similar language and experience with configuration management
Startup mentality with ability to work in fast-paced, high-growth environments and take on diverse responsibilities
Experience with logging and metrics collection for production systems
Understanding of system reliability principles and interest in learning SRE practices

Preferred

Some exposure to observability tools like Vector, Loki, Grafana, Prometheus, or similar monitoring systems
Experience with Terraform or other infrastructure as code tools
Familiarity with NixOS or other declarative system configuration approaches
Basic knowledge of HashiCorp Vault, Okta, or similar identity/secrets management tools
Interest in distributed systems and troubleshooting complex technical issues
Previous startup experience or demonstrated ability to learn quickly and adapt
Linux system administration experience
AWS certification or demonstrated cloud platform knowledge

Company

Northwood

twittertwittertwitter
company-logo
Northwood was founded by Bridgit Mendler, Griffin Cleverly, and Shaurya Luthra with the mission to expand access to space by transforming satellite backhaul infrastructure.

Funding

Current Stage
Growth Stage
Total Funding
$136.4M
Key Investors
Andreessen Horowitz,Washington Harbour PartnersAlpine Space Ventures,Andreessen HorowitzAndreessen Horowitz,Founders Fund
2026-01-27Series B· $100M
2025-04-22Series A· $30M
2024-02-19Seed· $6.3M

Leadership Team

leader-logo
Bridgit Mendler
CEO – Cofounder
linkedin
G
Griffin Cleverly
Co-Founder, CTO
linkedin
Company data provided by crunchbase