SIGN IN
Senior Site Reliability Engineer (V) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Blue Coding · 9 hours ago

Senior Site Reliability Engineer (V)

Blue Coding specializes in hiring excellent developers and amazing people from all over Latin America and other parts of the world. They are seeking an experienced Site Reliability Engineer to help transform their client's Network Operations Center into a modern, hybrid SRE approach, defining reliability and building a high-performing reliability function from the ground up.
FinanceMedicalProfessional ServicesRetailSoftwareWeb Development
badNo H1Bnote

Responsibilities

Build the foundation: Design and implement SRE best practices, processes, and tooling
Lead operational transformation: Help transition their NOC into a technically empowered, automation-driven reliability team
Own observability and monitoring: Drive improvements in system monitoring, alerting, and dashboards using tools like Grafana, CloudWatch, and Datadog
Automate everything: Reduce manual effort and increase resilience through Terraform, scripting, and cloud-native automation
Define and measure reliability: Establish SLIs, SLOs, and error budgets that keep the team accountable to high uptime and stability goals
Collaborate and mentor: Work closely with DevOps, SysOps, and engineering teams while helping upskill existing NOC engineers
Be a change agent: Bring a forward-looking mindset, driving cultural and technical change across the organization

Qualification

Site Reliability EngineeringAWSTerraformSecrets ManagementPythonMonitoring ToolsIncident ResponseCI/CDCollaborationMentorship

Required

5+ years in SRE, DevOps, or advanced systems engineering roles
Proven experience building or transforming SRE practices—you know what it takes to stand up a new function
Experience in creating and managing, reporting, and analyzing stability metrics
Strong AWS expertise
Strong experience with Secrets Management tooling like AWS Secrets Manager, HashiCorp Vault, Keeper, or Infisical strongly desired
Experience in the Atlassian tool platform (Jira, Confluence, Bitbucket) strongly desired
Hands-on experience with Terraform and infrastructure-as-code. This should include tools like Chef, Puppet, or Ansible
Strong programming proficiency—with emphasis on scripting and serverless automation (e.g., AWS Lambda). Experience developing tools and integrations using Python or equivalent modern languages is essential
Capacity to interact with development teams to understand automation and monitoring needs
Proficiency in Python and/or Ruby for automation and integrations
Expertise in monitoring, observability, incident response, and service reliability
Ability to define observability, incident response, and SLIs/SLOs
Excellent collaborator with a passion for mentorship and team growth

Preferred

AWS certifications are highly preferred
Experience with Ruby or .NET is a plus, supporting interoperability and legacy service integrations
Experience in insurance, fintech, or other regulated industries
Familiarity with incident.io, Jira Service Manager, or similar ITSM tools
Background in CI/CD pipelines and modern DevOps practices

Benefits

100% Remote

Company

Blue Coding

twittertwittertwitter
company-logo
Top notch developers, ready to deploy.

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
David Hemmat
Founder and CEO
linkedin
Company data provided by crunchbase