Lumen Technologies · 9 hours ago
Cloud Site Reliability Engineer
Maximize your interview chances
Big DataInformation Services
Actively Hiring
Insider Connection @Lumen Technologies
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Architect, build, and maintain highly available, fault-tolerant systems using multiple cloud service providers.
Design and manage cloud infrastructure focusing on networking, scalability, security, and reliability.
Troubleshoot complex, cross-system issues involving cloud infrastructure, databases, and networking.
Implement and maintain guardrails to ensure consistent and secure operation of cloud workloads.
Lean in on automation of deployment pipelines (CI/CD), infrastructure provisioning, and scaling to ensure seamless performance of new deployments, focusing on GitOps principles.
Define infrastructure as code, enabling scalable, repeatable, and secure deployments.
Set up and enforce guardrails for databases, infrastructure, and applications, ensuring consistency and adherence to best practices.
Implement robust application and infrastructure monitoring using tools like Prometheus, Grafana, and potentially Datadog.
Documentation of standardized operational workflows to ensure adherence to requirements and dissemination best practices for production readiness and operationalization
Coach and mentor others in the business processes implemented in the team's applications, services, and workflows to provide resolution to support problems
Effectively manage and deliver multiple initiatives in parallel while maintaining ownership of key areas of responsibility across the organization
Indirectly influence the work of others to drive production readiness and operationalization of system and service components
Effectively estimate the time it will take to perform tasks and deliver or influence the work to be completed within those timeframes
Scale solutions to meet enhanced requirements for growth, new services, or network optimization by applying approved technologies and architectures for enhancements or additions
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Total 10+ years of experience with bachelor's degree in Computer Science, Engineering, or a related field.
5+ years of experience in site reliability engineering or a related field.
Deep expertise in cloud platforms, particularly AWS, Azure, and GCP.
Proficiency with automation tools such as Terraform, ArgoCD, and GitHub Actions.
Experience with Infrastructure as Code (IaC) tools (Terraform, CloudFormation)
Strong troubleshooting skills for complex systems involving cloud infrastructure, databases, and networking.
Experience with monitoring tools like Prometheus, Grafana, and Datadog.
Preferred
Excellent communication and collaboration skills.
Security best practices for cloud environments.
Benefits
Health, Life, Voluntary Lifestyle and other benefits and perks that enhance your physical, mental, emotional and financial wellbeing.
Company
Lumen Technologies
Lumen delivers the most secure platform for applications and data to help businesses, government and communities deliver amazing experiences
Funding
Current Stage
Public CompanyTotal Funding
$10.4M2023-05-22Post Ipo Equity
2020-01-31Post Ipo Debt
2018-06-21Post Ipo Equity· $2.4M
Recent News
The Motley Fool
2024-12-16
2024-12-11
2024-12-11
Company data provided by crunchbase