Insight Global · 11 hours ago
REMOTE Site Reliability Engineer
Insight Global is seeking a REMOTE Site Reliability Engineer with a strong software engineering background. The role focuses on enhancing the reliability of production systems, driving reliability outcomes, and automating processes within cross-functional teams.
Responsibilities
Embed with product and platform teams to own reliability for key services; come in and “run with” active projects
Define and drive SLOs/SLAs/SLIs; implement actionable alerting and dashboards (primary: Datadog)
Automate reliability work (deployment, scaling, failover, incident workflows) using code-first approaches
Author infrastructure as code (primarily Terraform) and collaborate on Docker/Kubernetes workflows
Instrument services (.NET primary stack; Python/Rust for tooling; Java is a plus) for observability and performance
Own incidents end-to-end: triage, root cause, postmortems, and preventative engineering
Apply systems thinking to reduce complexity, improve resilience, and increase change velocity safely
Partner with security and cloud teams on guardrails, least-privilege, and cross-cloud considerations
Write stories and technical docs that clarify problems, solutions, and acceptance criteria
Continuously improve reliability patterns, runbooks, and automation pipelines
Qualification
Required
Proven SRE experience (3+ years minimum at mid-staff level) owning reliability for production systems
Software engineering background with strong procedural thinking; you've shipped production code
Proficient in scripting languages such as Python, Bash, or similar
.NET expertise as the primary skillset (services, APIs, performance, instrumentation)
Datadog hands-on experience (dashboards, monitors, logs, APM, alerting)
AWS foundational knowledge (you don't need a pro cert; you can reason about core services and IAM)
Infrastructure as Code with Terraform (modules, state, environments)
Practical knowledge of Docker and Kubernetes (how it works, how to debug and operate)
Familiarity with SQL/Postgres (querying, performance basics)
Preferred
Continued education and/or advanced degree(s) in Computer Science, Information Technology, or a related field
AWS certifications (such as AWS Certified Solutions Architect, AWS Certified Database - Specialty, or AWS Certified Security - Specialty)
Ability to understand and refactor complex legacy software
Experience in environments subject to HIPAA and/or PCI regulations
Professional experience with project lifecycle planning such as Agile/Scrum
Comfortable with Atlassian software suite (Jira, Confluence, and OpsGenie)
Experience with Rust
AWS Glue
AWS Neptune or other AWS purpose-built databases
Company
Insight Global
Insight Global provides top talent and staffing solutions that help job seekers find careers in healthcare, finance, IT, and government.
H1B Sponsorship
Insight Global has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (281)
2024 (164)
2023 (75)
2022 (17)
2021 (3)
2020 (2)
Funding
Current Stage
Late StageTotal Funding
unknown2010-07-01Acquired
Recent News
Maryland Daily Record
2025-09-26
Company data provided by crunchbase