Strategic Staffing Solutions · 6 hours ago
Senior Site Reliability Engineer
Maximize your interview chances
Insider Connection @Strategic Staffing Solutions
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Writing and developing code to automate processes, such as analyzing logs, testing production environments and responding to any issues
Collaborates with agile teams and business partners to develop specifications that resolve problems and enhancement needs, including focusing on monitoring, and metrics for operational readiness
Identify bottlenecks in development and deployment processes and designs automation solutions to mitigate
Develop new capabilities in displaying/monitoring/alerting on key performance indicators by tracking business transactions in real-time
Maintain and grow knowledge of platform configuration management, monitoring of established metrics, and troubleshooting
Provides continuous feedback to development teams on system stability, defect analysis, and system enhancements
Design and develop alert escalation and incident response automation
Provide production support for cloud service outages and incidents and work on both tactical and strategic plans for outage prevention
Provide feedback on resiliency and maintainability of solutions to Cloud and App architects
Conduct disaster recovery scenario generation and testing
Implement sustainable, audit-ready processes that support information technology controls, including deployment execution, access management, audits, incident management and related requirements.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
5+ years hands on Azure SRE Engineer. Experience must be focused in Azure. Other cloud platforms are not relevant.
5+ years extensive experience in implementing the 4 golden signals in Azure
Monitoring and observability experience in Azure native tools like Azure Monitor and App Insights using Terraform. Must be very strong in Terraform.
Should have at least 5 years’ experience as a site reliability engineer on a cross functional agile team working in Azure.
Have working knowledge of agile development methodologies (scrum, sprints, KanBan etc.) and tools (Azure DevOps etc.)
Have at least 5 years hands-on experience using IaC tools Terraform, Github, Ansible and Packer. Expert level Terraform experience required.
Proven experience across testing, integration, source code management, deployment and containerization
Sound problem-solving skills with the ability to quickly process complex information and present it clearly and simply
Experience with cloud technologies and services including those for Compute, Storage, Databases and API Management
On-premise to cloud migration experience
Company
Strategic Staffing Solutions
Strategic Staffing Solutions is a recruiting company which helps companies find suitable employees in various industries.
Funding
Current Stage
Late StageRecent News
2024-05-21
2024-04-08
Company data provided by crunchbase