Microsoft · 2 days ago
Site Reliability Engineer
Wonder how qualified you are to the job?
Data ManagementDeveloper Tools
Insider Connection @Microsoft
Responsibilities
Participate in on-call rotations and incident responses throughout product development and operations cycles. On-call will require responding to support requests after normal business hours to include the weekends and/or holidays in a designated Microsoft office.
Monitor system performance and proactively identify and resolve issues to ensure high availability and performance.
Develop and maintain automation tools and processes for deployment, monitoring, and configuration management.
Apply troubleshooting skills, debugging tools, and examine logs, telemetry, and other methods to verify assumptions and customer impact. Proactively and reactively address findings with customer and/or service engineering efficiently via written and verbal communications.
Lead blameless postmortems for root cause and production resiliency.
Consult with developers to design services that scale in Azure.
Stay current with industry trends, emerging technologies, and best practices in site reliability engineering and cloud computing.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
4+ years technical experience in software engineering, network engineering, or systems administration
OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration
OR Master's Degree in Computer Science, Information Technology, or related field
Preferred
5+ years technical experience in software engineering, network engineering, or systems administration
OR Bachelor's Degree in computer science, information technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
OR Master's Degree in computer science, information technology, or related field AND 1+ years technical experience in software engineering, network engineering, or systems administration
Experience applying SRE principles in a large production environment
Demonstrated proficiency in cloud computing platforms (e.g., AWS, Azure, GCP) and related services (e.g., EC2, S3, VPC, IAM, Lambda)
Expertise in automation tools and frameworks (e.g., Terraform, Ansible, Chef, Puppet) and scripting languages (e.g., Python, Bash)
Deep understanding of containerization and orchestration technologies (e.g., Docker, Kubernetes)
Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack) and incident response processes
Demonstrated problem-solving skills and the ability to troubleshoot complex issues in distributed systems
Effective communication and collaboration skills, with the ability to work effectively in a cross-functional team environment
Benefits
Dental Insurance
Vision Insurance
Company
Microsoft
Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.
Funding
Current Stage
Public CompanyTotal Funding
$1MKey Investors
Technology Venture Investors
2024-01-02Undisclosed· Undisclosed
2022-12-09Post Ipo Equity· Undisclosed
1986-03-13IPO· nasdaq:MSFT
Leadership Team
Recent News
Digital Trends
2024-06-06
2024-06-06
2024-06-06
Company data provided by crunchbase