Lead Site Reliability Engineering - Network jobs in United States
cer-icon
Apply on Employer Site
company-logo

JPMorganChase · 17 hours ago

Lead Site Reliability Engineering - Network

JPMorgan Chase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers and businesses. As a Lead Site Reliability Engineer in the Network Product team, you will hold a leadership role, conducting resiliency design reviews and mentoring other engineers to ensure the reliability and stability of network services.

Asset ManagementBankingFinancial Services
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Demonstrates expertise in network reliability principles, including Permit to Operate, FMEA, and operational readiness, balancing new features, efficiency, and stability
Collaborates closely with network engineering teams (Datacenter, Firewall, Proxies, DMZ, Load Balancing, etc.) and Lines of Business to ensure alignment and optimal outcomes
Drives the adoption of network reliability best practices and robust observability across the organization, empirically demonstrating improvements through stability and reliability metrics
Acts as the bridge between Engineering, Operations, DevOps, and customers to build and maintain resilient, scalable, and secure network services
Tier-3 network support, providing operational support for major incidents and ensuring rapid resolution and root cause analysis
Fosters a culture of continual improvement, soliciting real-time feedback to enhance the customer and user experience
Ensures knowledge sharing and collaboration across teams, avoiding duplication of work and promoting innovation
Conducts blameless, data-driven post-mortems and regular team debriefs to enable learning from both successes and failures
Documents and shares knowledge, innovations, and best practices via internal forums, communities of practice, and industry conferences
Works with internal specialists, product, and engineering teams to package approaches, best practices, and lessons learned into thought leadership, methodologies, and published assets
Interacts with business, partners, and customer technical stakeholders to manage project scope, priorities, deliverables, risks and issues, and timelines for successful client outcomes
Demonstrates and champions site reliability culture and practices and exerts technical influence
Leads initiatives to improve the reliability and stability of your team’s applications and platforms using data-driven analytics to improve service levels
Collaborates with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers
Demonstrates a high level of technical expertise within one or more technical domains and proactively identifies and solves technology-related bottlenecks in your areas of expertise
Acts as the main point of contact during major incidents for your infrastructure and demonstrates the skills to identify and solve issues quickly to avoid financial losses
Documents and shares knowledge within your organization via internal forums and communities of practice

Qualification

Network reliability engineeringSD-WANCloud platformsObservability toolsTroubleshooting complex networksContinuous integration toolsNetworking protocolsScalable networking designLeadershipMentoringCommunicationProblem-solving

Required

Advanced proficiency in network reliability engineering, including Permit to Operate, FMEA, and operational readiness processes
Experience leading technologists to manage and solve complex network issues at a firmwide level
Ability to influence team culture by championing innovation and change for success
Proficiency in SD-WAN, cloud platforms (AWS, Azure, etc.), and major network technologies (Palo Alto, Juniper, F5, Broadcom, Arista, Cisco, etc.)
Proficiency in observability and monitoring tools such as Grafana, SevOne, Prometheus, Kibana, ThousandEyes, and Splunk
Demonstrated proficiency in troubleshooting and supporting complex networking environments, including Tier-3 operational support for major incidents
Experience with continuous integration and delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
Formal training or certification in network engineering concepts and 5+ years of applied experience
10+ years of experience leading technologists to manage and solve complex technical items within your domain of expertise
Experience in scalable networking design, including high availability, redundancy, failover, and load balancing
Experience troubleshooting networking protocols such as TCP/IP, HTTPS, and BGP
Experience in customer-facing migration, including service discovery, assessment, planning, execution, and operations

Preferred

CCIE
Load-balancing
SD-WAN
Observability tools
eBPF
Cloud certs

Benefits

Comprehensive health care coverage
On-site health and wellness centers
A retirement savings plan
Backup childcare
Tuition reimbursement
Mental health support
Financial coaching

Company

JPMorganChase

company-logo
With a history tracing its roots to 1799 in New York City, JPMorganChase is one of the world's oldest, largest, and best-known financial institutions—carrying forth the innovative spirit of our heritage firms in global operations across 100 markets.

H1B Sponsorship

JPMorganChase has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (3471)
2024 (3469)
2023 (3395)
2022 (3594)
2021 (2515)
2020 (2495)

Funding

Current Stage
Public Company
Total Funding
unknown
1998-02-01IPO

Leadership Team

leader-logo
Allison Beer
CEO of Card Services and Connected Commerce
linkedin
leader-logo
Dan Mendelson
CEO, Morgan Health
linkedin
Company data provided by crunchbase