Junior Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Lightspeed Systems · 5 days ago

Junior Site Reliability Engineer

Lightspeed Systems is a global leader in education technology, providing AI-powered solutions that keep students safe, engaged, and learning. The Junior Site Reliability Engineer will join a team focused on ensuring the reliability and scalability of the company's infrastructure, automating operations, and applying AI to real-world SRE challenges.

EdTechEducationInformation TechnologyMobile DevicesNetwork SecuritySoftware
badNo H1Bnote

Responsibilities

Develop a deep understanding of one or more infrastructure services and their role in our platform
Use Terraform to design, deploy, and maintain infrastructure as code (IaC)
Automate workflows and deployments with GitHub Actions, leveraging GitHub Copilot and other AI-assisted tools to improve reliability and speed
Explore agentic automation (e.g., incident triage, self-healing scripts, automated runbook generation) to advance our SRE capabilities
Participate in Level 1 on-call support: monitor, respond, escalate, and improve system stability
Assist with performance, load, and stress testing of web applications to identify bottlenecks and durability issues
Implement, maintain, and enhance observability and monitoring (e.g., via Datadog or similar)
Track work in Jira, communicate clearly about progress and blockers, and collaborate across teams (Platform, Product, QA, etc.)
Participate in incident response and post-mortems, contributing to reliability improvements

Qualification

TerraformLinux administrationContainersOrchestrationAWS servicesBasic programmingCI/CD workflowsMonitoringObservabilityCuriosityWillingness to learnProblem-solvingCommunication skills

Required

1–2 years in a DevOps, SRE, or infrastructure role (or equivalent experience)
Strong problem-solving and troubleshooting mindset
Excellent communication skills, curiosity, and willingness to learn in a distributed team environment
Interest in applying AI-assisted development tools and automation to SRE workflows
Basic programming experience (Go, Python, or JavaScript/Node.js)
Solid Linux administration fundamentals
Experience with containers and orchestration (e.g., Docker, AWS Fargate, ECS)
Hands-on exposure to Terraform and IaC concepts
Familiarity with AWS services and cloud infrastructure fundamentals
Exposure to CI/CD workflows (GitHub, GitHub Actions, etc.)
Awareness of observability, monitoring, and logging practices

Preferred

Experience or interest in agentic AI frameworks or LLM-based automation for scripting, diagnostics, or incident management
Hands-on familiarity with performance/stress-testing tools (k6, Locust, JMeter) and monitoring web applications under load
Familiarity with cloud networking, security best practices, and key AWS services (API Gateway, Lambda, ECS, DynamoDB, OpenSearch, Redis, PostgreSQL)
AWS SysOps Administrator – Associate certification (or equivalent experience)

Benefits

Medical, dental and vision insurance with healthy company contribution toward premiums.
Lightspeed kicks cash into your HSA if you participate in our HDHP.
Paid parental leave.
Healthy holiday and PTO policy, including Christmas to New Year’s Day break.
401(k) matching up to 6%
Work from where it makes sense.
Pet insurance.

Company

Lightspeed Systems

twittertwittertwitter
company-logo
Lightspeed Systems, Inc., provides web filtering, mobile device management, and device monitoring solutions for K-12 education.

Funding

Current Stage
Growth Stage
Total Funding
unknown
Key Investors
Genstar CapitalMadison Dearborn Partners
2022-03-02Private Equity
2019-03-29Private Equity

Leadership Team

leader-logo
Brian Thomas
Chief Executive Officer
linkedin
leader-logo
Kirk Orgeldinger
President & CFO
linkedin
Company data provided by crunchbase