NVIDIA · 3 months ago
Senior Storage and Networking Product Engineer
NVIDIA is a pioneer in AI, ML, and HPC technologies, seeking a Senior Storage and Networking Product Engineer to enhance their high-performance infrastructure. This role focuses on ensuring the optimal operation of compute platforms by integrating advanced storage systems and networking technologies, while also emphasizing low latency, efficiency, and scalability.
AI InfrastructureArtificial Intelligence (AI)Consumer ElectronicsFoundational AIGPUHardwareSoftwareVirtual Reality
Responsibilities
Architect, deploy, and maintain distributed storage clusters with a focus on scalable performance and data durability
Develop and improve high-performance networking architectures for storage environments, ensuring low-latency data paths for AI/ML and HPC workloads
Configure and tune RDMA, NVMe-over-Fabrics, RoCE, InfiniBand, and Ethernet-based fabrics for maximum performance
Partner with GPU, networking, and systems teams to ensure seamless end-to-end performance across the full stack
Develop automated systems for monitoring, recording, and notifying in storage and networking
Build and maintain capacity planning models for network efficiency and storage growth
Troubleshoot complex network-storage interactions, including bottlenecks in distributed filesystems, parallel storage, and interconnects
Implement data protection and compliance controls such as encryption in-transit, access control, and auditing. and foster automation in storage and networking operations through the utilization of infrastructure-as-code and orchestration guided by AI/ML
Qualification
Required
BS/MS in Computer Science, Electrical Engineering, or a related field, or equivalent experience
12+ years of experience in storage systems engineering, production infrastructure, or large-scale data center operations
Deep knowledge of networking protocols and technologies: TCP/IP, Ethernet, InfiniBand, RDMA, RoCE, NVMe-oF, Fibre Channel
Hands-on experience with high-performance storage systems: Lustre, GPFS, Ceph, distributed object storage, enterprise SAN/NAS
Expertise in Linux systems engineering, including tuning, performance analysis, and debugging
Skilled in coding/scripting using Python, Bash, Go, or C/C++ to automate, monitor, and optimize performance
Experience with configuration management/orchestration tools (Ansible, Terraform, Puppet, Chef, Kubernetes)
Familiarity with observability stacks (Prometheus, Grafana, Elastic, InfluxDB) to monitor and optimize storage and network performance
Proficient in recognizing and resolving complex system bottlenecks within storage and networking layers
Preferred
Experience crafting and operating RDMA-accelerated HPC/AI clusters at scale, with hands-on expertise with network topologies and large-scale switch/router deployments
Familiarity with network telemetry, packet capture tools (sFlow, NetFlow, Wireshark, and proven history of capacity planning and optimizing performance for distributed storage systems over high-speed networks
Background in jointly developing storage networks for AI/ML training pipelines, large-scale inference, and RAG workflows
Proficiency in hybrid cloud storage and networking solutions (like Kubernetes CSI, cloud-native fabrics, and hybrid on-prem/cloud setups)
Contributions to open-source networking or storage projects
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
Business Insider
2026-01-09
Business Insider
2026-01-09
Company data provided by crunchbase