Senior Infrastructure Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Adaption · 4 hours ago

Senior Infrastructure Engineer

Adaption is focused on building AI systems that are flexible, personalized, and accessible. The role involves setting up core infrastructure systems, including CI/CD pipelines and cloud infrastructure, while ensuring scalability and security.

Computer Software

Responsibilities

Design, build, and own the cloud infrastructure from scratch (0→1), with a focus on scalability, reliability, and security
Define and implement Infrastructure as Code (IaC) using Terraform (or similar tools)
Set up and manage Kubernetes clusters for service orchestration and deployment
Design, deploy, and manage GPU clusters for AI/ML training and inference workloads
Build CI/CD pipelines to automate deployment and testing
Implement observability best practices (monitoring, logging, alerting)
Collaborate closely with backend and product engineers to define deployment and scaling strategies
Ensure infrastructure cost-efficiency and help set up cost monitoring tools
Define internal best practices for DevOps and platform engineering

Qualification

Cloud infrastructure designInfrastructure as CodeKubernetes managementCI/CD pipeline developmentGPU cluster managementMonitoringLoggingCloud security best practicesStartup experienceService mesh knowledgeOpen-source contributions

Required

5+ years of experience as a Infrastructure / SRE / Platform / DevOps engineer
Experience designing and building cloud-native infrastructure from scratch (preferably in AWS, GCP, or Azure)
Proficient with Kubernetes (EKS, GKE, or self-managed clusters)
Experience setting up and managing GPU resources and clusters for machine learning workloads
Strong hands-on experience with Terraform and Infrastructure as Code best practices
Experience building CI/CD pipelines using tools like GitHub Actions, ArgoCD, CircleCI, or similar
Familiar with monitoring and logging stacks (e.g., Prometheus, Grafana, Datadog, ELK, etc.)
Security-minded and familiar with cloud and container security best practices
Ability to balance ideal solutions with startup pragmatism and speed

Preferred

Experience in a fast-paced startup environment
Knowledge of service mesh technologies (e.g., Istio, Linkerd)
Experience with serverless infrastructure or hybrid architectures
Contributions to open-source infrastructure tools

Benefits

Flexible work: In-person collaboration in the Bay Area, a distributed global-first team, and quarterly offsites.
Adaption Passport: Annual travel stipend to explore a country you've never visited. We're building intelligence that evolves alongside you, so we encourage you to keep expanding your horizons.
Lunch Stipend: Weekly meal allowance for take-out or grocery delivery.
Well-Being: Comprehensive medical benefits and generous paid time off.

Company

Adaption

twitter
company-logo

Funding

Current Stage
Early Stage
Company data provided by crunchbase