Senior Software Engineer - Managed Kubernetes jobs in United States
cer-icon
Apply on Employer Site
company-logo

Lambda · 1 month ago

Senior Software Engineer - Managed Kubernetes

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving a diverse range of customers. They are seeking a Senior Software Engineer to join their Managed Kubernetes team, where the role involves designing and maintaining scalable Kubernetes-based infrastructure and developing automation tools for cluster lifecycle management.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingData CenterGPUMachine Learning
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Design, build, and maintain scalable control plane services, operators, and custom Kubernetes controllers, while developing automation in Python/Go for end-to-end cluster lifecycle management — including provisioning, upgrades, patching, and deletion
Identify gaps and develop internal tools, APIs, and command-line interfaces (CLIs) that enable customers and ML/AI teams to deploy and effectively monitor inference services
Write resilient systems that gracefully handle failure across large-scale distributed environments
Develop automated tests to ensure quality and stability, and validate the clusters to identify and address hardware issues before delivery
Support and debug production issues through on-call rotation

Qualification

KubernetesGoPythonInfrastructure-as-codeLinux systemsNetworkingContainersCloud infrastructureAutomationDistributed systems

Required

6+ years of experience in software engineering
3+ years leading large-scale complex projects, or tech lead
At least two years of experience working on orchestration and deployment systems
Experience using Kubernetes and third-party operators (CRDs, CSI, CNI, etc.)
Strong programming skills in Go and Python; ability to collaborate effectively on shared codebases
Take pride in owning and delivering core components of products and platforms
Experience with infrastructure-as-code tools (e.g. Terraform, Pulumi)
Solid knowledge of Linux systems, networking, containers, and cloud infrastructure

Preferred

Deep Kubernetes and Linux expertise
Experience operating the control plane and low-level pieces of large-scale Kubernetes clusters
Experience with user-level restrictions and hardening (e.g. AppArmor)
Experience with HPC clusters, environments & tooling
Experience with machine learning/AI frameworks
Expertise with hybrid or multi-cloud Kubernetes environments
Familiarity with GPU, Infiniband, or high-performance computing on K8s
Past contributions to CNCF projects or Kubernetes SIGs a plus

Benefits

Generous cash & equity compensation
Health, dental, and vision coverage for you and your dependents
Wellness and commuter stipends for select roles
401k Plan with 2% company match (USA employees)
Flexible paid time off plan that we all actually use

Company

Lambda

twittertwittertwitter
company-logo
Lambda is a cloud-based platform that provides high-performance GPU hardware and cloud infrastructure for AI model training and inference.

H1B Sponsorship

Lambda has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (16)
2024 (1)
2023 (3)
2022 (2)
2021 (2)
2020 (3)

Funding

Current Stage
Late Stage
Total Funding
$3.19B
Key Investors
TWG GlobalJP MorganMacquarie Group
2025-11-18Series E· $1.5B
2025-08-19Debt Financing· $275M
2025-02-19Series D· $480M

Leadership Team

leader-logo
Stephen Balaban
Co-founder, CEO
linkedin
leader-logo
Michael Balaban
Co-Founder / CTO
linkedin
Company data provided by crunchbase