Lead Site Reliability Engineering - Network jobs in United States
cer-icon
Apply on Employer Site
company-logo

Chase · 16 hours ago

Lead Site Reliability Engineering - Network

JPMorgan Chase, one of the oldest financial institutions, offers innovative financial solutions and is seeking a Lead Site Reliability Engineer within the Network Product. This role involves leading technical efforts in network reliability, collaborating with various teams to ensure optimal outcomes, and acting as a technical lead while mentoring other engineers.

BankingFinancial Services

Responsibilities

Demonstrates expertise in network reliability principles, including Permit to Operate, FMEA, and operational readiness, balancing new features, efficiency, and stability
Collaborates closely with network engineering teams (Datacenter, Firewall, Proxies, DMZ, Load Balancing, etc.) and Lines of Business to ensure alignment and optimal outcomes
Drives the adoption of network reliability best practices and robust observability across the organization, empirically demonstrating improvements through stability and reliability metrics
Acts as the bridge between Engineering, Operations, DevOps, and customers to build and maintain resilient, scalable, and secure network services
Tier-3 network support, providing operational support for major incidents and ensuring rapid resolution and root cause analysis
Fosters a culture of continual improvement, soliciting real-time feedback to enhance the customer and user experience
Ensures knowledge sharing and collaboration across teams, avoiding duplication of work and promoting innovation
Conducts blameless, data-driven post-mortems and regular team debriefs to enable learning from both successes and failures
Documents and shares knowledge, innovations, and best practices via internal forums, communities of practice, and industry conferences
Works with internal specialists, product, and engineering teams to package approaches, best practices, and lessons learned into thought leadership, methodologies, and published assets
Interacts with business, partners, and customer technical stakeholders to manage project scope, priorities, deliverables, risks and issues, and timelines for successful client outcomes
Demonstrates and champions site reliability culture and practices and exerts technical influence
Leads initiatives to improve the reliability and stability of your team’s applications and platforms using data-driven analytics to improve service levels
Collaborates with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers
Demonstrates a high level of technical expertise within one or more technical domains and proactively identifies and solves technology-related bottlenecks in your areas of expertise
Acts as the main point of contact during major incidents for your infrastructure and demonstrates the skills to identify and solve issues quickly to avoid financial losses
Documents and shares knowledge within your organization via internal forums and communities of practice

Qualification

Network reliability engineeringSD-WANCloud platformsObservability toolsTroubleshooting complex networksContinuous integration toolsNetworking protocolsLeadershipMentoringCommunication

Required

Advanced proficiency in network reliability engineering, including Permit to Operate, FMEA, and operational readiness processes
Experience leading technologists to manage and solve complex network issues at a firmwide level
Ability to influence team culture by championing innovation and change for success
Proficiency in SD-WAN, cloud platforms (AWS, Azure, etc.), and major network technologies (Palo Alto, Juniper, F5, Broadcom, Arista, Cisco, etc.)
Proficiency in observability and monitoring tools such as Grafana, SevOne, Prometheus, Kibana, ThousandEyes, and Splunk
Demonstrated proficiency in troubleshooting and supporting complex networking environments, including Tier-3 operational support for major incidents
Experience with continuous integration and delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
Formal training or certification in network engineering concepts and 5+ years of applied experience
10+ years of experience leading technologists to manage and solve complex technical items within your domain of expertise
Experience in scalable networking design, including high availability, redundancy, failover, and load balancing
Experience troubleshooting networking protocols such as TCP/IP, HTTPS, and BGP
Experience in customer-facing migration, including service discovery, assessment, planning, execution, and operations

Preferred

CCIE
Load-balancing
SD-WAN
Observability tools
eBPF
Cloud certs

Benefits

Comprehensive health care coverage
On-site health and wellness centers
A retirement savings plan
Backup childcare
Tuition reimbursement
Mental health support
Financial coaching

Company

Chase provides broad range of financial services. It is a sub-organization of JP Morgan Chase.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Mike McDonnell
Managing Director, Head of Chase Travel Platform Product
linkedin
leader-logo
Nicole Sanchez
Managing Director, Consumer Bank, GM and Product Executive, Growth Financial Products
linkedin
Company data provided by crunchbase