Principal Site Reliability Engineer @ Zayo Group | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
Principal Site Reliability Engineer jobs in United States
98 applicants
company-logo

Zayo Group · 14 hours ago

Principal Site Reliability Engineer

ftfMaximize your interview chances
Telecommunications
check
H1B Sponsor Likelynote

Insider Connection @Zayo Group

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Develop and implement automation solutions to streamline operations and reduce manual effort.
Design and implement effective monitoring and alerting systems to proactively identify and address issues.
Own the incident lifecycle, from leading root cause analysis and resolution to implementing preventative measures to avoid future occurrences. Be on-call to diagnose and resolve critical service outages.
Proactively identify and mitigate potential system risks, focusing on automation, monitoring, and tooling to ensure high service availability.
Design and implement solutions to ensure our infrastructure can handle ever-growing demands while maintaining optimal application performance.
Work closely with developers, product managers, and other engineers to translate business needs into robust and reliable technical solutions. Become the beacon for best practices and efficient processes throughout the organization.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Site Reliability EngineeringLinuxPythonAutomation toolsMonitoring systemsKubernetesDockerAWSGoogle CloudTCP/IPAnsibleTerraformSevOneAssure1NagiosPrometheusGrafanaCactiBGPDNSTLSHTTP/SPuppetNetwork APIsNetwork orchestration platformsCritical thinking skills

Required

Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience.)
Minimum of ten (10) years of experience in a Site Reliability Engineering or related role.
Strong understanding of system administration, Linux, and scripting languages (Python and various shells.)
Expert at developing automation tools for monitoring, alerting, and deployment to ensure efficient and reliable operations.
Expert at designing and implementing monitoring systems at scale.
Expert at container orchestration (Kubernetes and Docker.)
Experience with monitoring platforms such as SevOne, Assure1, and Nagios and various vendor NMS systems.
Previous work in large scale distributed production environments.
Experience with a variety of cloud platforms and tools (AWS, Google, etc.)
Experience with a variety of monitoring and alerting tools (Prometheus, Grafana, Cacti, etc.)
Strong working knowledge of networking concepts and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S.
Experience with infrastructure management tools such as Ansible, Terraform, Puppet, to deploy and manage infrastructure at scale.
Proven leadership skills, with the ability to mentor and inspire others.
Excellent problem-solving, analytical, and critical thinking skills.
A passion for automation and building efficient systems.

Preferred

Experience working with various vendor APIs (or netconf) including Nokia, Juniper, Fujitsu, Infinera, Cisco, and Ciena.
Experience with various network orchestration platforms such as Ciena Blue Planet MDSO, Cisco NSO, Nokia NSP, or others.

Benefits

Excellent Health, Dental & Vision Insurance
Retirement 401(k) Savings Plan
Fitness membership discounts
Generous paid time off policy including paid parental leave

Company

Zayo Group

company-logo
For over 15 years, Zayo has been the driving force behind the world's most dynamic and forward-thinking enterprises, helping them pave the way to what's next.

H1B Sponsorship

Zayo Group has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (2)
2022 (9)
2021 (7)
2020 (2)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Laura Littman
Chief of Staff to the CEO
linkedin
leader-logo
Steve Smith
CEO
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot