Sr. Staff Software Engineer - HPC Network Engineering jobs in United States
cer-icon
Apply on Employer Site
company-logo

LinkedIn · 15 hours ago

Sr. Staff Software Engineer - HPC Network Engineering

LinkedIn is the world’s largest professional network, built to create economic opportunity for every member of the global workforce. We are seeking an HPC Network Engineer to design, deploy, and operate high-performance, low-latency Ethernet fabrics for large-scale GPU clusters, focusing on RoCE v2-based GPU interconnect networks supporting AI/ML training and inference workloads.

Professional NetworkingRecruitingSocial MediaSocial Recruiting
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Network architecture and design for large-scale LLM training and inference workloads
Design RoCE v2–based GPU interconnection fabrics for multi-rack and multi-pod GPU clusters
Define lossless Ethernet architectures (Clos / fat-tree / leaf-spine) optimized for RDMA
Select and validate 400G / 800G Ethernet switching platforms and NICs (ConnectX, BlueField, etc.)
Deep expertise in host-level and Kubernetes pod networking architectures, including enablement of high-performance features such as RDMA and GPU Direct
Experience in host network performance tuning for large-scale collective communications, balancing latency, throughput, and congestion control
Analyze system performance and diagnose complex cross-layer issues

Qualification

HPC NetworkingDistributed SystemsPerformance OptimizationTechnical LeadershipLinux System EngineeringNetwork ProtocolsContainer PlatformsData Pipeline ArchitecturesGoPythonC++Kubernetes

Required

BA/BS Degree in Computer Science or related technical discipline, or equivalent practical experience
10+ years of experience building and operating large-scale distributed systems or data-intensive backend platforms
Experience in one or more programming languages such as Go, Python, C++, or similar
Experience in Linux system engineering and host networking
Demonstrated knowledge of network protocols, fabric design, and performance optimization
Proven ability to lead complex technical initiatives end-to-end in a multi-team environment
Experience with system design skills with focus on scalability, reliability, and performance
Experience with container platforms (Kubernetes) and microservices

Preferred

Experience supporting large-scale AI or HPC workloads
Familiarity with LLM training frameworks and communication libraries (e.g., NCCL, MPI)
Experience with streaming systems (Kafka, Flink, Spark Streaming, or similar) and high-throughput data pipeline architectures
Experience with performance benchmarking and profiling tools
Experience with infrastructure automation or configuration management tools
Demonstrated influence across organizations (tech lead, architect, principal/IC leadership roles)

Benefits

Annual performance bonus
Stock
Benefits and/or other applicable incentive compensation plans
Generous health and wellness programs
Time away for employees of all levels

Company

LinkedIn

company-logo
LinkedIn is a professional networking site that allows users to create business connections, search for jobs, and find potential clients. It is a sub-organization of Microsoft.

H1B Sponsorship

LinkedIn has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (892)
2024 (1108)
2023 (913)
2022 (1580)
2021 (1043)
2020 (1146)

Funding

Current Stage
Public Company
Total Funding
$154.8M
Key Investors
Bain Capital VenturesGreylockSequoia Capital
2016-06-13Acquired
2016-02-15Private Equity
2014-04-01Series Unknown

Leadership Team

leader-logo
Ryan Roslansky
Chief Executive Officer
linkedin
leader-logo
Dan Shapero
Chief Operating Officer
linkedin
Company data provided by crunchbase