Senior Site Reliability Engineer - Networking jobs in United States
cer-icon
Apply on Employer Site
company-logo

Lambda · 1 month ago

Senior Site Reliability Engineer - Networking

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. The Senior Site Reliability Engineer - Networking will help scale Lambda’s high performance multi-tenant cloud network and contribute to the automation of network configuration and deployments while ensuring high availability and predictable networking performance.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingGPUMachine Learning
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Help scale Lambda’s high performance multi-tenant cloud network
Contribute to the reproducible automation of network configuration and deployments
Contribute to the implementation and operations of Software Defined Networks
Help to deploy and manage Spine and Leaf networks
Ensure high availability of our network through observability, failover, and redundancy
Ensure clients have predictable networking performance through the use of network engineering and other applicable technologies
Help with deploying and maintaining network monitoring and management tools
Participate in on-call

Qualification

Site Reliability EngineeringSoftware Defined NetworksLinux command linePython programmingNetwork monitoring toolsCI/CD toolsMulti-data center networksIncident response managementConfiguration management toolsKubernetesNetwork virtualizationBGP EVPN VXLANNext-Generation Firewalls

Required

5+ years of experience being a Site Reliability Engineer or Network Reliability Engineering
Been part of the implementation of production-scale networking projects
Experience being on-call and incident response management
Have experience building and maintaining Software Defined Networks (SDN), experience with OpenStack, Neutron, OVN
Are comfortable on the Linux command line, and have an understanding of the Linux networking stack
Have experience with multi-data center networks and hybrid cloud networks
Have Python programming experience and configuration management tools like Ansible
Have experience with CI/CD tools for deployment and GIT. Operated network environment with GitOps practices in place
Experience with application lifecycle and deployments on Kubernetes

Preferred

Operated production-scale SDNs in a cloud context (e.g. helped implement or operate the infrastructure that powers an AWS VPC-like feature)
Have Software development experience with C, GO, Python
Experience automating network configuration within public clouds, with tools like Kubernetes, HELM, Terraform, and Ansible
Deep understanding of the Linux networking stack and its interaction with network virtualization, SR-IOV and DPDK
Understanding of the SDN ecosystem (e.g. OVS, Neutron, VMware NSX, Cisco ACI or Nexus Fabric Controller, Arista CVP)
Have experience with Spine and Leaf (Clos) network topology
Have experience and understanding of BGP EVPN VXLAN networks
Experience with building and maintaining multi-data center networks, SD-WAN, DWDM
Experience with Next-Generation Firewalls (NGFW)

Benefits

Health, dental, and vision coverage for you and your dependents
Wellness and commuter stipends for select roles
401k Plan with 2% company match (USA employees)
Flexible paid time off plan that we all actually use

Company

Lambda

twittertwittertwitter
company-logo
Lambda is a cloud-based platform that provides high-performance GPU hardware and cloud infrastructure for AI model training and inference.

H1B Sponsorship

Lambda has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (16)
2024 (1)
2023 (3)
2022 (2)
2021 (2)
2020 (3)

Funding

Current Stage
Late Stage
Total Funding
$3.19B
Key Investors
TWG GlobalJP MorganMacquarie Group
2025-11-18Series E· $1.5B
2025-08-19Debt Financing· $275M
2025-02-19Series D· $480M

Leadership Team

leader-logo
Stephen Balaban
Co-founder, CEO
linkedin
leader-logo
Michael Balaban
Co-Founder / CTO
linkedin
Company data provided by crunchbase