Apply on Employer Site

Intellum · 11 hours ago

Lead Site Reliability Engineer

United States

Full-time

Remote

Lead/Staff

10+ years exp

Intellum is seeking a Lead Site Reliability Engineer to spearhead their SRE team. In this role, you will drive operational maturity by defining reliability standards and enhancing the security posture while scaling the Intellum platform.

EducationEnterprise SoftwareHuman ResourcesInformation TechnologySaaS

H1B Sponsor Likely

Responsibilities

Set clear goals for the SRE team and partner with Engineering leadership to align platform initiatives with business objectives

Lead the definition and enforcement of SLAs, SLIs, and SLOs. Architect observability frameworks to translate telemetry data into actionable roadmaps that reduce toil and enhance resilience

Take ownership of critical code components (i.e., Queues, Enrollments) and lead efforts to identify bottlenecks, optimize performance, and improve code quality across the engineering department

Champion infrastructure security. Partner with InfoSec to define hardening standards, manage perimeter defense (WAF/DDoS), and automate vulnerability remediation within the CI/CD pipeline

Participate in the 24x7 on-call rotation and lead post-incident reviews (RCAs), ensuring action items are implemented to improve MTTR and prevent recurrence

Empower developers with better tooling and guidance on performant coding practices, fostering a culture of collaboration and reliability and 'you build it, you run it'

Qualification

Ruby on RailsCloud ComputingInfrastructure as CodeSQL DatabasesKubernetesDeep ObservabilitySecurity FocusCI/CD ExpertiseIncident ManagementProactive Problem-SolvingDocumentation & Training

Required

10+ years of engineering experience, with 5+ years specifically developing Ruby on Rails applications

Expertise in Cloud Computing (AWS/GCP) and Infrastructure as Code (Terraform/Ansible)

Strong proficiency with SQL databases (PostgreSQL) and the ability to quickly navigate and optimize complex, unfamiliar codebase

Proven experience designing monitoring solutions (Datadog, New Relic, Prometheus) based on the 'Golden Signals'

Demonstrated ability to define SLIs/SLOs from scratch, negotiate Error Budgets, and use data to balance feature velocity with reliability

Experience securing cloud environments and container platforms (Kubernetes), including hands-on management of WAF rules and edge security

Experience leading post-incident reviews (RCAs) and implementing action items that directly improve MTTR (Mean Time to Recovery) and MTTD (Mean Time to Detection)

Proven experience leading technical teams, mentoring engineers, and working in a team-oriented, collaborative environment with strong communication skills

Skilled in documenting solutions and training operational teams on how to effectively support and maintain systems

Demonstrated ability to communicate clearly, seek help proactively, and take ownership of tasks, leading them to completion

Bachelor's degree in Computer Science or related technical field

Preferred

Experience in developing solutions using server automation tools such as Terraform, Ansible

Experience in writing and maintaining CI/CD pipelines and services

Experience in building, deploying, and optimizing Kubernetes-based infrastructure

Experience configuring and managing Web Application Firewalls (WAF) (e.g., Cloudflare, AWS WAF, Akamai) and DDoS protection mechanisms

Company

Intellum

Intellum is a provider of integrated brand building, web presence strategy solutions and managed services company.

Founded in 2000

Atlanta, Georgia, USA

51-200 employees

http://www.intellum.com

H1B Sponsorship

Intellum has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2022 (1)

Funding

Current Stage

Growth Stage

Total Funding

$25M

Key Investors

Guidepost Growth Equity

2023-08-02Private Equity· $25M

Leadership Team

David Pitta

Chief Marketing Officer

Greg Rose

Chief Experience Officer

Recent News

Morningstar.com

Intellum Announces 2025 Growth Makers Awards Winners

2025-09-19

Morningstar.com

Intellum Unveils the New Evolve: A Reimagined Future for Learning Content Creation

2025-09-09

PR Newswire

Intellum Now Available on Google Cloud Marketplace to Streamline Access to Scalable Learning Solutions

2025-06-13

Company data provided by crunchbase