Jobs via Dice ยท 20 hours ago
Site Reliability Engineer
Apex Systems is a world-class IT services company seeking a highly skilled Site Reliability Engineer. This role focuses on improving system reliability and operational intelligence using a modern tech stack, while collaborating closely with Production Support teams to enhance system performance.
Computer Software
Responsibilities
Design and implement highly available, scalable, and performant systems
Collaborate with Production Support REs to analyze operational data and identify systemic issues
Drive continuous improvement through blameless postmortems and actionable insights
Apply Generative AI to accelerate incident triage, root cause analysis, and predictive reliability
Implement AI-driven automation for operational workflows while ensuring safety, security, and compliance
Build dashboards for reliability metrics, insights, and incident trends using React
Develop, support, and maintain automation scripts, APIs, and GenAI integration pipelines using Python
Design and maintain schemas for incident data, runbooks, and knowledge graphs with performance optimization using MongoDB
Implement advanced search and observability solutions for logs and RCA repositories using Elasticsearch
Capture learnings from production interactions and share insights through dashboards and reports across engineering and support teams
Qualification
Required
Proven experience in Site Reliability Engineering or related roles
Strong background in Production Support for large-scale systems
Familiarity with reliability enablement frameworks
React for front-end dashboards
Strong Python development for automation and backend services
MongoDB NoSQL data modeling and performance tuning
Elasticsearch for search and observability use cases
Demonstrated interest or hands-on experience with Generative AI and integrating models into operational workflows
Strong problem-solving, analytical, communication, and collaboration skills
Preferred
Preferred experience with agentic AI, LangChain, and LangGraph
Benefits
Medical
Dental
Vision
Life
Disability
Other insurance plans
ESPP (employee stock purchase program)
401K program
HSA (Health Savings Account on the HDHP plan)
SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions
Corporate discount savings program
Certification prep
Library of technical and leadership courses/books/seminars
Certification discounts
Career Coach
Company
Jobs via Dice
Welcome to Jobs via Dice, the go-to destination for discovering the tech jobs you want.
Funding
Current Stage
Early StageCompany data provided by crunchbase