Apply on Employer Site

Blue Coding · 9 hours ago

Senior Site Reliability Engineer (V)

United States

Full-time

Remote

Senior Level

5+ years exp

Blue Coding specializes in hiring excellent developers and amazing people from all over Latin America and other parts of the world. They are seeking an experienced Site Reliability Engineer to help transform their client's Network Operations Center into a modern, hybrid SRE approach, defining reliability and building a high-performing reliability function from the ground up.

FinanceMedicalProfessional ServicesRetailSoftwareWeb Development

No H1B

Responsibilities

Build the foundation: Design and implement SRE best practices, processes, and tooling

Lead operational transformation: Help transition their NOC into a technically empowered, automation-driven reliability team

Own observability and monitoring: Drive improvements in system monitoring, alerting, and dashboards using tools like Grafana, CloudWatch, and Datadog

Automate everything: Reduce manual effort and increase resilience through Terraform, scripting, and cloud-native automation

Define and measure reliability: Establish SLIs, SLOs, and error budgets that keep the team accountable to high uptime and stability goals

Collaborate and mentor: Work closely with DevOps, SysOps, and engineering teams while helping upskill existing NOC engineers

Be a change agent: Bring a forward-looking mindset, driving cultural and technical change across the organization

Qualification

Site Reliability EngineeringAWSTerraformSecrets ManagementPythonMonitoring ToolsIncident ResponseCI/CDCollaborationMentorship

Required

5+ years in SRE, DevOps, or advanced systems engineering roles

Proven experience building or transforming SRE practices—you know what it takes to stand up a new function

Experience in creating and managing, reporting, and analyzing stability metrics

Strong AWS expertise

Strong experience with Secrets Management tooling like AWS Secrets Manager, HashiCorp Vault, Keeper, or Infisical strongly desired

Experience in the Atlassian tool platform (Jira, Confluence, Bitbucket) strongly desired

Hands-on experience with Terraform and infrastructure-as-code. This should include tools like Chef, Puppet, or Ansible

Strong programming proficiency—with emphasis on scripting and serverless automation (e.g., AWS Lambda). Experience developing tools and integrations using Python or equivalent modern languages is essential

Capacity to interact with development teams to understand automation and monitoring needs

Proficiency in Python and/or Ruby for automation and integrations

Expertise in monitoring, observability, incident response, and service reliability

Ability to define observability, incident response, and SLIs/SLOs

Excellent collaborator with a passion for mentorship and team growth

Preferred

AWS certifications are highly preferred

Experience with Ruby or .NET is a plus, supporting interoperability and legacy service integrations

Experience in insurance, fintech, or other regulated industries

Familiarity with incident.io, Jira Service Manager, or similar ITSM tools

Background in CI/CD pipelines and modern DevOps practices

Benefits

100% Remote

Company

Blue Coding

Top notch developers, ready to deploy.

Founded in 2014

Miami, Florida, USA

51-200 employees

https://www.bluecoding.com/

Funding

Current Stage

Growth Stage

Leadership Team

David Hemmat

Founder and CEO

Company data provided by crunchbase