Senior SRE jobs in United States
cer-icon
Apply on Employer Site
company-logo

Blackpoint Cyber · 10 hours ago

Senior SRE

Blackpoint Cyber is the leading provider of world-class cybersecurity threat hunting, detection and remediation technology. The Senior Site Reliability Engineer will be responsible for designing, implementing, and maintaining cloud and on-premise infrastructure, focusing on automation, scalability, and performance while collaborating with cross-functional teams to ensure system reliability.

ComputerCyber SecurityNetwork SecuritySoftware
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Infrastructure Management & Automation: Design, develop, and maintain highly scalable infrastructure utilizing Infrastructure as Code (IaC) methodologies, with primary focus on Terraform and Terragrunt for automated cloud resource provisioning and orchestration
Cloud Platform Administration: Oversee and optimize cloud environments, with a specialized focus on Amazon Web Services (AWS), ensuring adherence to cost optimization strategies, security best practices, and high-availability standards
Container Orchestration & Continuous Delivery: Manage and optimize Kubernetes cluster environments utilizing Helm, ArgoCD, Istio, and Kustomize to support continuous delivery pipelines and infrastructure-as-code practices
Data Streaming Platform Operations: Administer and scale data streaming infrastructure using Confluent Cloud and Apache Kafka to support enterprise-level data processing requirements
Caching & Real-Time Data Solutions: Deploy, configure, and maintain Redis instances to facilitate caching mechanisms and real-time data processing capabilities
Observability & Incident Management: Implement and maintain comprehensive monitoring, alerting, and incident response frameworks utilizing Prometheus, Grafana, Alert Manager, and OpsGenie/PagerDuty to ensure optimal system reliability and performance
Feature Management & Release Engineering: Facilitate controlled feature deployments and progressive rollouts through LaunchDarkly/PostHog platform integration and management
Cross-Functional Collaboration: Partner with software development teams to ensure seamless integration of new services, applications, and features into existing infrastructure ecosystems
Technical Issue Resolution: Diagnose and resolve complex system-level issues, implementing solutions that maintain high performance standards and maximize system uptime
Process Optimization & Enhancement: Drive continuous improvement initiatives for automation tools, operational processes, and engineering methodologies to enhance system scalability, reliability, and maintainability
Technical Innovation & Knowledge Management: Maintain current knowledge of emerging Site Reliability Engineering trends, tools, and technologies, ensuring organizational adoption of relevant industry advancements and best practices

Qualification

TerraformAWSKubernetesApache KafkaRedisPrometheusGrafanaLaunchDarklyCI/CDProblem-solvingCommunicationCollaboration

Required

Professional Experience: Minimum of eight (8) years of demonstrated experience in a Senior Site Reliability Engineer role or equivalent position, with substantial emphasis on cloud infrastructure management and automation technologies
Infrastructure as Code Proficiency: Expertise in Infrastructure as Code (IaC) methodologies, specifically utilizing Terraform and Terragrunt for enterprise-scale deployments
Cloud Architecture Expertise: Comprehensive knowledge of Amazon Web Services (AWS) cloud platform, including demonstrated proficiency in designing, implementing, and maintaining secure, scalable, and resilient cloud architectures aligned with industry best practices
Distributed Streaming Systems: Extensive hands-on experience architecting and managing distributed data streaming solutions utilizing Confluent Cloud and Apache Kafka platforms
Data Storage & Caching Technologies: Proven experience implementing and managing Redis for high-performance caching solutions and Amazon RDS for relational database management
Search & Analytics Platforms: Proven experience with enterprise search and analytics solutions, including OpenSearch, Elasticsearch, and ChaosSearch platforms
Observability & Monitoring Systems: Proficiency in designing and implementing comprehensive monitoring and alerting infrastructures utilizing Prometheus, Grafana, Alert Manager, and OpsGenie/PagerDuty
Feature Management Platforms: Practical experience implementing and managing feature flag systems using LaunchDarkly/PostHog for controlled release management
Container Orchestration Expertise: Extensive experience administering production-grade Kubernetes clusters, including package management via Helm, continuous deployment using ArgoCD, and service mesh implementation with Istio
Configuration Management: Working knowledge of Kustomize for Kubernetes resource configuration and customization
Excellent problem-solving skills with the ability to troubleshoot complex systems in production
Strong communication and collaboration skills, with experience working in agile environments

Preferred

Multi-Cloud Platform Experience: Demonstrated experience architecting and managing infrastructure across multiple cloud service providers, including Google Cloud Platform (GCP) and Microsoft Azure
Security & Compliance Expertise: Comprehensive understanding of security frameworks, compliance standards, and best practices applicable to cloud-native and containerized infrastructure environments
Serverless & CI/CD Proficiency: Working knowledge of serverless computing architectures and continuous integration/continuous deployment (CI/CD) pipelines, including practical experience with Jenkins and GitHub Actions platforms
Software Development Capabilities: Technical proficiency in software development using Node.js, Python, and/or Go programming languages

Benefits

Health, Vision, Dental, and Life Insurance plans
Robust 401k plan
Discretionary Time Off
Other minor perks

Company

Blackpoint Cyber

twittertwittertwitter
company-logo
Blackpoint Cyber is a provider of cybersecurity threat hunting, detection, and response technology.

H1B Sponsorship

Blackpoint Cyber has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (2)

Funding

Current Stage
Growth Stage
Total Funding
$201.4M
Key Investors
Bain Capital Tech OpportunitiesWP Global Partners
2023-06-08Series C· $190M
2022-05-20Series B
2020-07-08Series B· $5.4M

Leadership Team

leader-logo
Jon Murchison
Founder and Executive Chairman
linkedin
leader-logo
Manoj Srivastava
Chief Technology and Product Officer
linkedin
Company data provided by crunchbase