Brilliant® · 16 hours ago
Software Engineer - Reliability Engineering
Brilliant® is a company focused on creating intelligent automation for incident detection and response. They are seeking a Senior Site Reliability Engineer who will work on developing automation, designing internal tools for service health, and applying AI to enhance operational workflows.
Staffing & Recruiting
Responsibilities
Develop automation that minimizes manual operational effort and improves team effectiveness
Design and maintain internal tooling that provides visibility into service health and reliability trends
Rethink post-incident analysis by transforming learnings into proactive safeguards
Evaluate and implement modern techniques for observability, telemetry, and alerting
Apply deep engineering expertise to diagnose and resolve complex production issues
Investigate how machine learning and AI can enhance signal detection, prioritization, and response workflows
Collaborate with engineering partners to examine incidents and address systemic reliability gaps
Lead technical discussions that influence reliability architecture and operational standards
Translate recurring operational challenges into scalable engineering solutions
Help define and evolve best practices for incident response and operational excellence
Qualification
Required
Experience working with compiled languages (such as Java, C#, or Go) and scripting or dynamic languages (such as Python, Ruby, or JavaScript), with a solid understanding of when to use each
Strong background in distributed systems, including familiarity with common failure scenarios
Hands-on experience creating internal platforms, automation frameworks, or developer-facing tools
Proficiency with version control systems and continuous integration / deployment workflows
Experience managing infrastructure through code and designing service APIs
Demonstrated ability to reduce operational overhead through thoughtful automation
Ownership of production systems, including participation in on-call rotations and incident response
Systems-oriented thinker who considers interactions and dependencies at scale
Comfortable investigating ambiguous problems and contributing solutions in collaborative settings
Receptive to feedback and able to synthesize multiple perspectives into practical outcomes
Clear technical communicator, capable of producing both detailed documentation and architectural diagrams
Detail-oriented with strong analytical problem-solving skills
Preferred
chaos testing
lean or agile methodologies
open-source involvement
public speaking experience
Benefits
Healthcare
PTO
401k
Company
Brilliant®
About Brilliant® Brilliant is an award-winning staffing and recruiting firm that provides direct-hire and contract staffing services in accounting, finance, technology, and business operations serving businesses across the continental U.S.