Sightview Software · 3 hours ago
Site Reliability Engineer
Sightview Software LLC is a healthcare technology company providing intelligent EHR and practice management solutions built exclusively for eye care. The role of the Site Reliability Engineer focuses on supporting the monitoring, observability, and incident response functions of the AWS-based infrastructure while ensuring system health and improving alerting quality.
Responsibilities
Maintain and enhance monitoring of dashboards, alerts, and metrics using Datadog, AWS CloudWatch, Prometheus, and Grafana
Monitor system health and application performance across AWS environments
Tune alerts to reduce noise and ensure timely detection of issues
Assist in defining service-level indicators (SLIs) and alert thresholds
Ensure logs, metrics, and traces are available and actionable
Participate in on-call rotations and respond to production incidents
Perform initial incident triage, troubleshooting, and escalation as needed
Contribute to root cause analysis (RCA) and post-incident reviews
Follow and help improve operational runbooks and response procedures
Collaborate with senior SREs and engineering teams during incident resolution
Support AWS infrastructure with a focus on observability and reliability
Assist with Terraform-managed infrastructure changes related to monitoring and alerting
Assist with performance, load, and stress testing of web applications to identify bottlenecks and durability issues
Develop scripts and automation (Python, Bash, Powershell) to improve operational efficiency
Work with application teams to ensure services are properly instrumented
Participate in a rotating on-call schedule shared across the SRE/Operations team
On-call coverage includes monitoring alerts, responding to incidents, and coordinating resolutions
Incidents outside business hours are expected to be addressed according to defined SLAs
The team emphasizes alert quality, runbooks, and shared responsibility to minimize burnout
Qualification
Required
3–6 years of experience in SRE, DevOps, or production operations roles
Hands-on experience with AWS (EC2, ECS, ELB/ALB, Route53, IAM, CloudWatch)
Experience using Datadog for monitoring, dashboards, and alerting
Knowledge of Windows and Linux systems administration
Experience with scripting and automation (Python, Bash, Powershell)
Experience supporting production environments with on-call responsibilities
Preferred
Experience with Terraform or other Infrastructure-as-Code tools
Exposure to HIPAA-compliant or regulated systems
Familiarity with PHP in a support or troubleshooting context
Experience with Prometheus and Grafana
Familiarity with CI/CD tools (Bitbucket)
Familiarity with microservices architectures
Company
Sightview Software
The only technology partner solely focused on eyecare, Sightview offers ophthalmologists, optometrists, and opticians a single, end-to-end solution that addresses the needs and challenges of today’s eyecare providers.
Funding
Current Stage
Growth StageRecent News
2025-10-29
2024-04-10
2024-04-10
Company data provided by crunchbase