Principal Software Engineer - Networking jobs in United States
info-icon
This job has closed.
company-logo

San Francisco Compute Company · 2 months ago

Principal Software Engineer - Networking

San Francisco Compute Company is building a liquid market for GPU offtake to mitigate risks in financing GPU clusters. As a Principal Software Engineer - Networking, you will design and operate the infrastructure for GPU clusters, focusing on system software, orchestration, and distributed automation.

Information TechnologyInternet
check
H1B Sponsorednote

Responsibilities

Design and operate orchestration frameworks to manage tens of thousands of GPUs across Kubernetes, virtualization, and bare metal
Develop automation frameworks for large-scale provisioning, monitoring, and fault tolerance
Build distributed systems that can withstand node or cluster-wide failures
Architect software-defined networking solutions that integrate with underlay switches and support scalable designs
Collaborate with networking specialists to ensure fabric resilience, low latency, and scalability, leveraging routing protocols like BGP where needed
Integrate high-performance distributed storage with compute and networking layers

Qualification

Distributed systemsSoftware-defined networkingAutomation frameworksLinux internalsGPU/HPC clustersNetworking protocolsScripting skillsDocumentation skillsCollaboration skillsProblem-solving skills

Required

Strong software engineering background, with experience building fault-tolerant distributed systems
Comfortable with Linux internals, debugging, and performance optimization
Exposure to GPU/HPC clusters
Networking literacy: familiar with eBGP, VXLAN, RoCEv2, and InfiniBand, plus an understanding of how to design software systems that dynamically leverage these fabrics
Strong automation, scripting, and documentation skills

Preferred

Go or Rust experience (3+ years)
Deep knowledge of HPC fabrics (InfiniBand, Ultra Ethernet, RoCEv2)
Experience with high-performance storage (WEKA, VAST, Ceph, etc.)
Prior exposure to global distributed compute operations

Benefits

Generous equity grant
Visa Sponsorships
Retirement matching
Medical, dental & vision
Time off
Parental leave
Daily lunch
Unlimited office book budget

Company

San Francisco Compute Company

twittertwittertwitter
company-logo
Compute is a commodity. We think people should buy it like one.

Funding

Current Stage
Early Stage
Total Funding
$52M
Key Investors
Altman Capital
2025-11-26Series A· $40M
2024-07-16Series Unknown· $12M
Company data provided by crunchbase