SRE - Fiserv jobs in United States
cer-icon
Apply on Employer Site
company-logo

ShiftCode Analytics, Inc. ยท 4 months ago

SRE - Fiserv

ShiftCode Analytics, Inc. is looking for an SRE to support the day-to-day operations of the Digital group within the Card Services organization. The candidate will provide on-call service continuity and escalated support for various production applications while collaborating closely with development and technology groups.

AnalyticsConsultingInformation Technology
badNo H1Bnote

Responsibilities

Provide 24x7 support of production Internet applications on a rotating basis
Hands on understanding of Linux systems
Good understanding of cloud concepts
Point of escalation for application support to diagnose and resolve complex customer issues in accessing the Portal and Web Services environments
Drive Open Systems SEVerity crisis technical bridges and/or management bridges, as required and leverages experience and organizational knowledge to reduce MTTR
Review turnover paperwork to ensure that they are complete prior to production installs
Participate in the requirements gathering process, representing the production environments, to ensure that all operational aspects are identified and documented. Provide all tasks and detailed estimates to project managers, review and approve design documentation to ensure understanding of business logic changes and technical solution being implemented
Works with Change Management/ Release Managers to review propose change events for production
Work with FTS and Open System development to perform project code installations with assistance from the development and business groups. Validate successful implementations or fallbacks
Document install-defects and assign severity to the problems that occurred. After fallback, perform post mortem to identify root cause analysis (RCA)
Direct incident recovery, and cross-functional teams to collaborate on identified issues
Identify and implement improvements to incident recovery, incident engagement, and incident communications
Perform trending and analysis of problems; anticipate problems and develop risk mitigation plans
Participate in internal and external audits, as requires by management
Ensures monitoring alerts and system events are assessed, prioritized, and worked aggressively
Escalate issues to the technology, operations, and/or vendor(s) where appropriate
Ensure database/application controls and procedures remain compliant with Corporate IT risk
Support Disaster Recovery tests and live recovery for all production environments
Work with Card Services architects to validate and design enterprise solutions and application monitoring tools

Qualification

Cloud technologiesKubernetes platformsLinux systemsAzure Redhat OpenShiftAzure Kubernetes ServiceSoftware engineeringIncident recoveryRoot cause analysisCross-functional collaboration

Required

On prem & Cloud based environment
Cloud technologies a must
Good experience with Kubernetes platforms
Azure Redhat openshift
Azure Kubernetes Service
Any software engineering experience a plus
Linux systems experience needed
Mandatory rotating 24/7 production support responsibilities (every few weeks)
Experience acting as an escalation point to troubleshoot and resolve complex issues
Oversee production changes, installs, testing, and ensure compliance with IT and audit requirements
Incident recovery, perform root cause analysis, and improve response processes
Partner with developers, architects, and cross-functional teams to deliver solutions
Provide 24x7 support of production Internet applications on a rotating basis
Hands on understanding of Linux systems
Good understanding of cloud concepts
Point of escalation for application support to diagnose and resolve complex customer issues in accessing the Portal and Web Services environments
Drive Open Systems SEVerity crisis technical bridges and/or management bridges, as required and leverages experience and organizational knowledge to reduce MTTR
Review turnover paperwork to ensure that they are complete prior to production installs
Participate in the requirements gathering process, representing the production environments, to ensure that all operational aspects are identified and documented
Provide all tasks and detailed estimates to project managers, review and approve design documentation to ensure understanding of business logic changes and technical solution being implemented
Works with Change Management/ Release Managers to review propose change events for production
Work with FTS and Open System development to perform project code installations with assistance from the development and business groups
Validate successful implementations or fallbacks
Document install-defects and assign severity to the problems that occurred
After fallback, perform post mortem to identify root cause analysis (RCA)
Direct incident recovery, and cross-functional teams to collaborate on identified issues
Identify and implement improvements to incident recovery, incident engagement, and incident communications
Perform trending and analysis of problems; anticipate problems and develop risk mitigation plans
Participate in internal and external audits, as requires by management
Ensures monitoring alerts and system events are assessed, prioritized, and worked aggressively
Escalate issues to the technology, operations, and/or vendor(s) where appropriate
Ensure database/application controls and procedures remain compliant with Corporate IT risk
Support Disaster Recovery tests and live recovery for all production environments
Work with Card Services architects to validate and design enterprise solutions and application monitoring tools

Company

ShiftCode Analytics, Inc.

twittertwitter
company-logo
ShiftCode Analytics Inc is a Tampa, FL based firm formed with one sole purpose of delivering best and quick services to its clients nationwide.

Funding

Current Stage
Growth Stage
Company data provided by crunchbase