Manager, Reliability Engineering @ myGwork - LGBTQ+ Business Community | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
Manager, Reliability Engineering jobs in Bellevue, WA
Be an early applicantLess than 25 applicants
company-logo

myGwork - LGBTQ+ Business Community · 2 days ago

Manager, Reliability Engineering

Wonder how qualified you are to the job?

ftfMaximize your interview chances
Internet

Insider Connection @myGwork - LGBTQ+ Business Community

Discover valuable connections within the company who might provide insights and potential referrals, giving your job application an inside edge.

Responsibilities

Define, manage, and measure incident response engineering practices
Liaise with engineering teams to ensure work discovered during incident response is prioritized
Participate in incident response engineering duties as necessary
Manage a global Reliability Operations team (3 to 6+ Reliability operations engineers across NAMER, EMEA, APAC) with periodic weekend coverage requirements
Adaptive management style according to level and proficiency of engineering reports
Ability to understand technical employee career paths and collaboratively develop career plans
Scheduling a global team through holidays, sickness and vacation leaves, across timezones
Understanding of large-scale distributed system architectures (e.g., databases, web services, application services)
Familiarity with monitoring tools (e.g., Prometheus, Grafana, Nagios)
Ability to author scripts to facilitate troubleshooting as well as configure alerts
Ability to prioritize and manage incidents based on severity, with a focus on customer impact
Ability to remain calm under pressure and quickly diagnose issues
Understanding of system logs, metrics, telemetry
Ability to take command and confidently direct engineering resources in ambiguous situations
Ability to communicate effectively with stakeholders during an incident
Ability to maintain and update trouble-shooting guides (TSGs) and operational documentation

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Technical TroubleshootingLarge-Scale System ArchitecturesMonitoring ToolsScriptingIncident ManagementCalm under PressureSystem LogsResource ManagementDocumentation ManagementProblem-SolvingHard WorkingCommunicationLeadershipPythonBash

Required

Bachelor’s Degree from a four-year university or relevant substitute experience
6+ years relevant work experience in Technical and/or Application Support with strong knowledge technical troubleshooting
2-5 years of management experience with direct reports
Understanding of large-scale distributed system architectures (e.g., databases, web services, application services)
Familiarity with monitoring tools (e.g., Prometheus, Grafana, Nagios)
Ability to author scripts to facilitate troubleshooting as well as configure alerts
Ability to prioritize and manage incidents based on severity, with a focus on customer impact
Ability to remain calm under pressure and quickly diagnose issues
Understanding of system logs, metrics, telemetry
Ability to take command and confidently direct engineering resources in ambiguous situations
Ability to communicate effectively with stakeholders during an incident
Ability to maintain and update trouble-shooting guides (TSGs) and operational documentation

Preferred

Proficiency in scripting languages (e.g., Python, Bash) is a plus
Periodically meeting with reports across timezones
There may be periodic weekend coverage requirements
Adaptive management style according to level and proficiency of engineering reports
Ability to understand technical employee career paths and collaboratively develop career plans
Scheduling a global team through holidays, sickness and vacation leaves, across timezones

Benefits

Comprehensive healthcare (medical, dental, and vision) with premiums paid in full for employees and dependents
Retirement benefits such as a 401k plan and company match
Short and long-term disability coverage
Basic life insurance
Well-being benefits
Reimbursement for certain tuition expenses
Parental leave
Sick time of 1 hour per 30 hours worked
Vacation time for full-time employees up to 120 hours thru the first year and 160 hours thereafter
Around 13 paid holidays per year
Employee Stock Purchase Plan

Company

myGwork - LGBTQ+ Business Community

twittertwittertwitter
company-logo
myGwork is the largest global platform for the LGBTQ+ business community.

Funding

Current Stage
Early Stage
Total Funding
$4.77M
Key Investors
24 HaymarketInnovate UK
2023-08-17Series Unknown· $1.66M
2023-08-17Grant· Undisclosed
2021-12-07Series A· $2.12M

Leadership Team

leader-logo
Adrien Gaubert
Co-Founder & CMO
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot