Site Reliability Engineer jobs in United States
info-icon
This job has closed.
company-logo

Jobs via Dice ยท 20 hours ago

Site Reliability Engineer

Apex Systems is a world-class IT services company seeking a highly skilled Site Reliability Engineer. This role focuses on improving system reliability and operational intelligence using a modern tech stack, while collaborating closely with Production Support teams to enhance system performance.

Computer Software

Responsibilities

Design and implement highly available, scalable, and performant systems
Collaborate with Production Support REs to analyze operational data and identify systemic issues
Drive continuous improvement through blameless postmortems and actionable insights
Apply Generative AI to accelerate incident triage, root cause analysis, and predictive reliability
Implement AI-driven automation for operational workflows while ensuring safety, security, and compliance
Build dashboards for reliability metrics, insights, and incident trends using React
Develop, support, and maintain automation scripts, APIs, and GenAI integration pipelines using Python
Design and maintain schemas for incident data, runbooks, and knowledge graphs with performance optimization using MongoDB
Implement advanced search and observability solutions for logs and RCA repositories using Elasticsearch
Capture learnings from production interactions and share insights through dashboards and reports across engineering and support teams

Qualification

Site Reliability EngineeringProduction SupportGenerative AIPythonMongoDBReactElasticsearchAnalytical skillsProblem-solvingCommunicationCollaboration

Required

Proven experience in Site Reliability Engineering or related roles
Strong background in Production Support for large-scale systems
Familiarity with reliability enablement frameworks
React for front-end dashboards
Strong Python development for automation and backend services
MongoDB NoSQL data modeling and performance tuning
Elasticsearch for search and observability use cases
Demonstrated interest or hands-on experience with Generative AI and integrating models into operational workflows
Strong problem-solving, analytical, communication, and collaboration skills

Preferred

Preferred experience with agentic AI, LangChain, and LangGraph

Benefits

Medical
Dental
Vision
Life
Disability
Other insurance plans
ESPP (employee stock purchase program)
401K program
HSA (Health Savings Account on the HDHP plan)
SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions
Corporate discount savings program
Certification prep
Library of technical and leadership courses/books/seminars
Certification discounts
Career Coach

Company

Jobs via Dice

twitter
company-logo
Welcome to Jobs via Dice, the go-to destination for discovering the tech jobs you want.

Funding

Current Stage
Early Stage
Company data provided by crunchbase