Senior Site Reliability Engineer @ Avenue Code | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
Senior Site Reliability Engineer jobs in United States
80 applicants
company-logo

Avenue Code · 5 hours ago

Senior Site Reliability Engineer

ftfMaximize your interview chances
ConsultingE-Commerce
check
Culture & Values
check
H1B Sponsor Likelynote

Insider Connection @Avenue Code

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

Contribute to overall change, incident and problem management in our environment with a focus on troubleshooting and fast restoration of our essential services and preventing future outages.
Participate in a once-a-month 24×7 on-call rotation and take leadership of severe incidents to help minimize impact.
Assist engineering teams by conducting truly blameless post-mortems with focused action items to drive continuous improvements.
Provide insights on trends of issues affecting reliability and partner in cross-functional projects to provide scalable solutions.
Review and advise on high-risk platform changes to minimize impact to the site and maximize success for stakeholders.
Work within a large distributed system based on Cloud Native services.
Maintain an automation-centric vision and incorporate SRE methodologies to increase reliability and decrease toil.
Create operating standards to help drive reliability at Credit Karma.
Experience with Site Reliability Engineering with a focus on Infrastructure, Platform, and Application (Cloud, Containerization, Container orchestration, Network, Application Reliability, Database Architecture) and an understanding of full stack and SDLC practices (Software Development Life Cycle) in DevOps or continuous release environment.
Experience in running critical incidents in a global or company-wide context, engaging with executives and senior leadership, and leading root cause analysis sessions.
Experience running and monitoring applications at scale, using metrics and tracing tools like, New Relic, Data Dog, Stackdriver, Zipkin, Prometheus, etc.
Professional experience with Python, Go, or similar programming languages.
Familiarity with SRE methodologies; passionate about solving operational challenges by using automation and software.
Ability to communicate effectively vertically and horizontally within the organization through demonstrating written and verbal communication skills.
Ability to drive troubleshooting through a pragmatic and collaborative approach.
Can construct clear and concise insights from data to promote and champion measurable improvements.
Experience working with Cloud Native services in a Public Cloud, e.g. Google Cloud Platform, AWS, Azure.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Site Reliability EngineeringCloud Native servicesIncident managementPythonGoDatabase ArchitectureDevOps practicesCloud platformsAutomationNew RelicData DogStackdriverZipkinPrometheus

Required

Experience with Site Reliability Engineering with a focus on Infrastructure, Platform, and Application (Cloud, Containerization, Container orchestration, Network, Application Reliability, Database Architecture) and an understanding of full stack and SDLC practices (Software Development Life Cycle) in DevOps or continuous release environment.
Experience in running critical incidents in a global or company-wide context, engaging with executives and senior leadership, and leading root cause analysis sessions.
Experience running and monitoring applications at scale, using metrics and tracing tools like, New Relic, Data Dog, Stackdriver, Zipkin, Prometheus, etc.
Professional experience with Python, Go, or similar programming languages.
Familiarity with SRE methodologies; passionate about solving operational challenges by using automation and software.
Ability to communicate effectively vertically and horizontally within the organization through demonstrating written and verbal communication skills.
Ability to drive troubleshooting through a pragmatic and collaborative approach.
Can construct clear and concise insights from data to promote and champion measurable improvements.
Experience working with Cloud Native services in a Public Cloud, e.g. Google Cloud Platform, AWS, Azure.

Company

Avenue Code

company-logo
Avenue Code is an IT consulting firm that specializes on Agile ecommerce software development.

H1B Sponsorship

Avenue Code has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (6)
2022 (10)
2021 (11)
2020 (14)

Funding

Current Stage
Late Stage
Total Funding
unknown
2022-11-09Acquired· undefined

Leadership Team

leader-logo
Zeo Solomon
Co-Founder and Chief Strategy Officer
linkedin
leader-logo
Amir Razmara
Managing Partner
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot