NVIDIA · 1 week ago
Senior Network Software Engineer
Wonder how qualified you are to the job?
Insider Connection @NVIDIA
Responsibilities
Collaborate with multi-functional teams to analyze, co-design, and develop networking software and hardware for innovative AI platforms.
Drive the development of new networking algorithms and protocols for point-to-point and collective operations at scale.
Identify bottlenecks and inefficiencies in application code, proposing optimizations to enhance performance and network utilization.
Design and implement performance benchmarks and testing methodologies to evaluate performance at scale.
Provide guidance and recommendations for optimizing AI applications for speed, scalability, and resource efficiency.
Share knowledge with domain expert teams as they develop applications for the next generation of AI platforms.
Contribute to the development of tools and frameworks to facilitate network optimization.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
PhD in Computer Science, Computer Engineering, or related field, or equivalent experience
10+ years of experience with a focus on high-performance networking and AI applications
Expertise in RDMA networking (InfiniBand, ROCE), Ethernet, and PCIe
Experience with at least one high-performance networking library: NCCL, UCX, libfabric, MPI, UCC
Deep understanding of various aspects of high-performance networking, including network technologies, debugging, and performance analysis
Experience in developing and optimizing deep learning frameworks such as PyTorch and TensorFlow
Proficiency in Python and C/C++
Experience in CUDA programming
Track record of delivering performance improvements for software used in large-scale deployments
Preferred
Knowledge of Kubernetes (k8s) and cloud-native application principles is a plus
Familiarity with continuous integration and delivery practices for performance optimization
Hands-on experience in optimizing networking building blocks for DL frameworks like PyTorch and TensorFlow
Experience in developing communication libraries such as NCCL, UCX, UCC, MPI
In-depth knowledge of RDMA, GPU-Direct, and network technologies
Provide references to your code contributions
Benefits
Equity
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Trends of Total Sponsorships
2023 (735)
2022 (892)
2021 (696)
2020 (534)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2017-05-24Post Ipo Equity· $4B
Recent News
2024-06-06
2024-06-06
2024-06-06
Company data provided by crunchbase