Lambda · 1 month ago
Senior Software Engineer - Managed Kubernetes
Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving a diverse range of customers. They are seeking a Senior Software Engineer to join their Managed Kubernetes team, where the role involves designing and maintaining scalable Kubernetes-based infrastructure and developing automation tools for cluster lifecycle management.
AI InfrastructureArtificial Intelligence (AI)Cloud ComputingData CenterGPUMachine Learning
Responsibilities
Design, build, and maintain scalable control plane services, operators, and custom Kubernetes controllers, while developing automation in Python/Go for end-to-end cluster lifecycle management — including provisioning, upgrades, patching, and deletion
Identify gaps and develop internal tools, APIs, and command-line interfaces (CLIs) that enable customers and ML/AI teams to deploy and effectively monitor inference services
Write resilient systems that gracefully handle failure across large-scale distributed environments
Develop automated tests to ensure quality and stability, and validate the clusters to identify and address hardware issues before delivery
Support and debug production issues through on-call rotation
Qualification
Required
6+ years of experience in software engineering
3+ years leading large-scale complex projects, or tech lead
At least two years of experience working on orchestration and deployment systems
Experience using Kubernetes and third-party operators (CRDs, CSI, CNI, etc.)
Strong programming skills in Go and Python; ability to collaborate effectively on shared codebases
Take pride in owning and delivering core components of products and platforms
Experience with infrastructure-as-code tools (e.g. Terraform, Pulumi)
Solid knowledge of Linux systems, networking, containers, and cloud infrastructure
Preferred
Deep Kubernetes and Linux expertise
Experience operating the control plane and low-level pieces of large-scale Kubernetes clusters
Experience with user-level restrictions and hardening (e.g. AppArmor)
Experience with HPC clusters, environments & tooling
Experience with machine learning/AI frameworks
Expertise with hybrid or multi-cloud Kubernetes environments
Familiarity with GPU, Infiniband, or high-performance computing on K8s
Past contributions to CNCF projects or Kubernetes SIGs a plus
Benefits
Generous cash & equity compensation
Health, dental, and vision coverage for you and your dependents
Wellness and commuter stipends for select roles
401k Plan with 2% company match (USA employees)
Flexible paid time off plan that we all actually use
Company
Lambda
Lambda is a cloud-based platform that provides high-performance GPU hardware and cloud infrastructure for AI model training and inference.
H1B Sponsorship
Lambda has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (16)
2024 (1)
2023 (3)
2022 (2)
2021 (2)
2020 (3)
Funding
Current Stage
Late StageTotal Funding
$3.19BKey Investors
TWG GlobalJP MorganMacquarie Group
2025-11-18Series E· $1.5B
2025-08-19Debt Financing· $275M
2025-02-19Series D· $480M
Recent News
2026-01-11
2026-01-09
Company data provided by crunchbase