SIGN IN
Site Reliability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Sightview Software · 3 hours ago

Site Reliability Engineer

Sightview Software LLC is a healthcare technology company providing intelligent EHR and practice management solutions built exclusively for eye care. The role of the Site Reliability Engineer focuses on supporting the monitoring, observability, and incident response functions of the AWS-based infrastructure while ensuring system health and improving alerting quality.
Electronic Health Record (EHR)EyewearMedicalSoftware
Hiring Manager
Manjot Kaur
linkedin

Responsibilities

Maintain and enhance monitoring of dashboards, alerts, and metrics using Datadog, AWS CloudWatch, Prometheus, and Grafana
Monitor system health and application performance across AWS environments
Tune alerts to reduce noise and ensure timely detection of issues
Assist in defining service-level indicators (SLIs) and alert thresholds
Ensure logs, metrics, and traces are available and actionable
Participate in on-call rotations and respond to production incidents
Perform initial incident triage, troubleshooting, and escalation as needed
Contribute to root cause analysis (RCA) and post-incident reviews
Follow and help improve operational runbooks and response procedures
Collaborate with senior SREs and engineering teams during incident resolution
Support AWS infrastructure with a focus on observability and reliability
Assist with Terraform-managed infrastructure changes related to monitoring and alerting
Assist with performance, load, and stress testing of web applications to identify bottlenecks and durability issues
Develop scripts and automation (Python, Bash, Powershell) to improve operational efficiency
Work with application teams to ensure services are properly instrumented
Participate in a rotating on-call schedule shared across the SRE/Operations team
On-call coverage includes monitoring alerts, responding to incidents, and coordinating resolutions
Incidents outside business hours are expected to be addressed according to defined SLAs
The team emphasizes alert quality, runbooks, and shared responsibility to minimize burnout

Qualification

AWSDatadogScriptingLinux AdministrationTerraformPrometheusGrafanaOn-call ExperienceCI/CD ToolsMicroservicesPHPHIPAA Compliance

Required

3–6 years of experience in SRE, DevOps, or production operations roles
Hands-on experience with AWS (EC2, ECS, ELB/ALB, Route53, IAM, CloudWatch)
Experience using Datadog for monitoring, dashboards, and alerting
Knowledge of Windows and Linux systems administration
Experience with scripting and automation (Python, Bash, Powershell)
Experience supporting production environments with on-call responsibilities

Preferred

Experience with Terraform or other Infrastructure-as-Code tools
Exposure to HIPAA-compliant or regulated systems
Familiarity with PHP in a support or troubleshooting context
Experience with Prometheus and Grafana
Familiarity with CI/CD tools (Bitbucket)
Familiarity with microservices architectures

Company

Sightview Software

twittertwitter
company-logo
The only technology partner solely focused on eyecare, Sightview offers ophthalmologists, optometrists, and opticians a single, end-to-end solution that addresses the needs and challenges of today’s eyecare providers.

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Tycene Fritcher
Chief Executive Officer
linkedin
leader-logo
Jeff Macomber
Chief Technology Officer
linkedin
Company data provided by crunchbase