Lambda · 1 month ago
Senior Site Reliability Engineer - Networking
Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. The Senior Site Reliability Engineer - Networking will help scale Lambda’s high performance multi-tenant cloud network and contribute to the automation of network configuration and deployments while ensuring high availability and predictable networking performance.
AI InfrastructureArtificial Intelligence (AI)Cloud ComputingGPUMachine Learning
Responsibilities
Help scale Lambda’s high performance multi-tenant cloud network
Contribute to the reproducible automation of network configuration and deployments
Contribute to the implementation and operations of Software Defined Networks
Help to deploy and manage Spine and Leaf networks
Ensure high availability of our network through observability, failover, and redundancy
Ensure clients have predictable networking performance through the use of network engineering and other applicable technologies
Help with deploying and maintaining network monitoring and management tools
Participate in on-call
Qualification
Required
5+ years of experience being a Site Reliability Engineer or Network Reliability Engineering
Been part of the implementation of production-scale networking projects
Experience being on-call and incident response management
Have experience building and maintaining Software Defined Networks (SDN), experience with OpenStack, Neutron, OVN
Are comfortable on the Linux command line, and have an understanding of the Linux networking stack
Have experience with multi-data center networks and hybrid cloud networks
Have Python programming experience and configuration management tools like Ansible
Have experience with CI/CD tools for deployment and GIT. Operated network environment with GitOps practices in place
Experience with application lifecycle and deployments on Kubernetes
Preferred
Operated production-scale SDNs in a cloud context (e.g. helped implement or operate the infrastructure that powers an AWS VPC-like feature)
Have Software development experience with C, GO, Python
Experience automating network configuration within public clouds, with tools like Kubernetes, HELM, Terraform, and Ansible
Deep understanding of the Linux networking stack and its interaction with network virtualization, SR-IOV and DPDK
Understanding of the SDN ecosystem (e.g. OVS, Neutron, VMware NSX, Cisco ACI or Nexus Fabric Controller, Arista CVP)
Have experience with Spine and Leaf (Clos) network topology
Have experience and understanding of BGP EVPN VXLAN networks
Experience with building and maintaining multi-data center networks, SD-WAN, DWDM
Experience with Next-Generation Firewalls (NGFW)
Benefits
Health, dental, and vision coverage for you and your dependents
Wellness and commuter stipends for select roles
401k Plan with 2% company match (USA employees)
Flexible paid time off plan that we all actually use
Company
Lambda
Lambda is a cloud-based platform that provides high-performance GPU hardware and cloud infrastructure for AI model training and inference.
H1B Sponsorship
Lambda has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (16)
2024 (1)
2023 (3)
2022 (2)
2021 (2)
2020 (3)
Funding
Current Stage
Late StageTotal Funding
$3.19BKey Investors
TWG GlobalJP MorganMacquarie Group
2025-11-18Series E· $1.5B
2025-08-19Debt Financing· $275M
2025-02-19Series D· $480M
Recent News
2026-01-09
2026-01-08
2025-12-25
Company data provided by crunchbase