Senior Software Engineer, Observability jobs in United States
cer-icon
Apply on Employer Site
company-logo

PlayOn Sports · 1 month ago

Senior Software Engineer, Observability

PlayOn is a dynamic growth-stage company dedicated to championing the spirit of play in the high school space. They are seeking an experienced Senior Software Engineer to enhance the reliability, performance, and scalability of their systems, focusing on building tools and automation that enable resilient software delivery.

BroadcastingContentDigital MediaEvent ManagementEventsInternetNewsTicketingVideo on DemandVideo Streaming

Responsibilities

Assess and improve visibility: Work with engineering teams to review our current dashboards, metrics, and logs, identify the biggest gaps, and make targeted improvements that help us better understand system health
Tighten monitoring and alerting: Refine alerts and dashboards for the most critical services so we can catch issues earlier and respond faster
Build observability into delivery: Add instrumentation and telemetry into existing build and deploy processes to make reliability checks part of our normal release workflow
Clarify what "reliable" means: Help define initial SLIs and SLOs for a few core user flows, aligning the team on what good performance and availability look like
Streamline incident response: Partner with the Event Commander/on-call rotation to improve how we communicate, coordinate, and follow up during incidents
Reduce manual effort: Automate routine checks and monitoring tasks to free up engineers for more impactful work
Contribute to system observability i.e implementing, improving metrics, alerting, and dashboards for better insight and faster recovery
Develop automation, tooling, and monitoring solutions to support high service availability
Partner with application and quality engineering teams to implement best practices in reliability, release automation, and testing
Drive operational excellence through proactive incident prevention, blameless postmortems, and capacity planning
Participate in on-call rotations to support critical services and ensure rapid response to incidents

Qualification

PythonCloud infrastructureCI/CD pipelinesObservability toolsJavaLinux systemsAutomated testing frameworksCollaborationProblem-solving

Required

Solid experience in Python, especially for automation, tooling, and data-driven operational tasks
Proficiency in at least one (Java, C++, or Go)
Strong understanding of Linux systems, cloud infrastructure (AWS, GCP, or Azure), and modern deployment practices (Docker, Kubernetes, Terraform)
Experience with CI/CD pipelines, version control, and automated testing frameworks
Experience with observability tools (e.g., Prometheus, Grafana, ELK, Datadog, etc.) and log/metric analysis for diagnosing issues
Proven experience facilitating and documenting Critical User Journeys translating them to actionable SLA/SLO for automation
Demonstrated ability to collaborate with cross-functional teams and communicate clearly in high-impact situations
A problem-solver who approaches reliability as a shared responsibility across engineering

Preferred

Experience writing or maintaining end-to-end or integration tests for distributed systems
Background in performance testing, capacity planning, or chaos engineering
Contributions to internal developer tooling or reliability-focused frameworks
Exposure to security, compliance, or change management processes in production environments
Relevant certifications

Benefits

Multiple medical insurance plans to choose from
Dental, vision life and disability insurance
Employee Emergency Fund
Company equity (stock options)
Open PTO policy
401K plan with company match
Hybrid/flexible work environment

Company

PlayOn Sports

twittertwitter
company-logo
PlayOn is the all-in-one fan engagement platform for schools.

Funding

Current Stage
Late Stage
Total Funding
$72.31M
Key Investors
BIP CapitalHerff JonesHamilton Ventures LLC
2022-04-26Acquired
2022-02-01Private Equity
2021-01-08Series Unknown· $10M

Leadership Team

leader-logo
Perkins Miller
Chief Executive Officer
linkedin
leader-logo
David Rudolph
President, Streaming and Coaching Tools
linkedin
Company data provided by crunchbase