Mattermost · 8 hours ago
Lead Site Reliability Engineer (SRE)
Mattermost is a collaborative workflow solution provider for critical infrastructure organizations. They are seeking a Lead Site Reliability Engineer (SRE) to guide the architecture and operational excellence of their secure collaboration platform while mentoring engineers and driving strategic initiatives for scalability and performance.
File SharingInformation ServicesInformation TechnologyMessagingSaaSSoftware
Responsibilities
Define the strategy, architecture, and roadmap for Mattermost’s site reliability engineering function, aligning infrastructure initiatives with product and business goals
Lead the design, deployment, and optimization of production-grade containerized workloads, infrastructure-as-code, and compliant cloud environments for regulated domains (e.g., FedRAMP, DoD)
Establish and evolve observability, monitoring, and alerting frameworks to ensure performance, reliability, and capacity planning at scale
Drive incident management processes, including on-call rotations, root cause analysis, and systemic reliability improvements
Partner with security and compliance teams to meet data sovereignty, security, and regulatory requirements
Champion automation and operational excellence to improve efficiency, reduce risk, and scale operations
Oversee cloud cost management and capacity planning to optimize infrastructure spending while meeting performance targets
Build and maintain a developer platform that enables fast, secure software delivery and improves application stability in production
Mentor and coach SRE team members, fostering a culture of learning, collaboration, and technical excellence
Qualification
Required
BS in Computer Science, Cybersecurity, Software Engineering, or a related technical field, or equivalent experience, with 5+ years of relevant experience in site reliability engineering, DevOps, or cloud infrastructure roles
Proven expertise in container orchestration platforms, ideally Kubernetes
Extensive experience with infrastructure-as-code, ideally Terraform
Strong background in cloud platforms, ideally AWS
Demonstrated experience designing and implementing monitoring, alerting, and performance optimization strategies
Exceptional troubleshooting and incident management skills for distributed systems
Proficiency in at least one scripting or programming language for automation
Excellent communication skills with a track record of influencing cross-functional teams
Experience leading globally distributed teams in a remote-first environment
For candidates residing in the U.S.: This role may require the ability to obtain and maintain a U.S. government security clearance in the future. As such, U.S. applicants must be U.S. citizens and eligible under applicable clearance requirements
Applicants must meet eligibility requirements for access to export-controlled information as defined by U.S. export control laws, including EAR and ITAR
Preferred
Familiarity with observability stacks such as Grafana and Prometheus
Experience designing high-availability, disaster recovery, and scaling architectures
Exposure to GCP and Azure cloud environments
Leadership experience in highly regulated industries such as defense, finance, or critical infrastructure
Experience with U.S. federal compliance frameworks and authorization processes, including FedRAMP, DoD ATO, NIST 800-53, and related government standards
Experience preparing, delivering, and maintaining software offerings through AWS Marketplace and other cloud provider marketplaces (e.g., Azure Marketplace, Google Cloud Marketplace), including packaging, compliance validation, and ongoing operational support
Open-source contributions in reliability, DevOps, or infrastructure tooling
Certifications in cloud infrastructure, reliability, or DevOps engineering (e.g., CKA, CKAD, AWS Certified Solutions Architect)
Company
Mattermost
Mattermost is an open source platform for secure collaboration across the entire software development lifecycle.
Funding
Current Stage
Growth StageTotal Funding
$73.5MKey Investors
Y CombinatorRedpointS28 Capital
2019-06-19Series B· $50M
2019-02-05Series A· $20M
2017-02-15Seed· $3.5M
Recent News
2025-09-19
Company data provided by crunchbase