Replit · 1 week ago
Senior Infrastructure Engineer
Replit is the agentic software creation platform that enables anyone to build applications using natural language. As a Senior Infrastructure Engineer, you will help ensure the reliability, scalability, and performance of Replit's infrastructure, implementing automation and best practices to support millions of developers worldwide.
Artificial Intelligence (AI)Cloud ComputingDeveloper ToolsInformation TechnologySoftware
Responsibilities
Drive Automation and Infrastructure as Code: Build and improve automation to eliminate toil and operational work. Maintain CI/CD pipelines and infrastructure automation using tools like Terraform or Pulumi. Create self-healing systems that can automatically respond to common failure scenarios
Optimize Performance and Infrastructure: Collaborate with core infrastructure and product teams to performance tune and optimize our cloud deployments (Kubernetes, Docker, GCP). Identify and resolve performance bottlenecks and implement capacity planning strategies
Elevate Developer Experience: Design and implement improvements to our build, test, and deployment systems to make software delivery faster, safer, and more reliable for all engineers
Drive Cross-Team Improvements: Partner with service owners across Replit to understand their pain points, and collaborate on implementing build/test/deploy enhancements within their specific services
Build Shared Tooling: Create and maintain centralized tooling and automation that improves the engineering lifecycle, from local development to production monitoring
Debug and Harden Systems: Dive deep into debugging difficult technical problems, making our systems and products more robust, operable, and easier to diagnose
Collaborate on Design Reviews: Participate in feature and system design reviews, contributing expertise on security, scale, and operational considerations
Build and Integrate: Write high-quality, well-tested code to meet the needs of your customers, including building pipelines to integrate with 3rd party vendors
Qualification
Required
4+ years of experience in Site Reliability Engineering or similar roles (DevOps, Systems Engineering, Infrastructure Engineering)
Strong programming skills in languages like Python or Go
You write high-quality, well-tested code
Solid understanding of distributed systems. You've built, scaled, and maintained production services and understand service-oriented architecture
Experience with container orchestration platforms (Kubernetes) and cloud-native technologies
Experience implementing and maintaining monitoring/observability solutions, with strong skills in debugging and performance tuning
Strong incident management skills with experience participating in incident response and demonstrated critical thinking under pressure
Experience with infrastructure as code (e.g., Terraform) and configuration management tools
Excellent written and verbal communication skills, with an ability to explain technical concepts clearly
A willingness to dive into understanding, debugging, and improving any layer of the stack
You're passionate about making software creation accessible and empowering the next generation of builders
Preferred
Experience with Google Cloud Platform (GCP) services and tools
Knowledge of modern observability platforms (Prometheus, Grafana, Datadog, etc.)
Experience building reliable systems capable of handling high throughput and low latency
Experience with Go and Terraform
Familiarity with working in rapid-growth environments
Benefits
Competitive Salary & Equity
401(k) Program
Health, Dental, Vision and Life Insurance
Short Term and Long Term Disability
Paid Parental, Medical, Caregiver Leave
Commuter Benefits
Monthly Wellness Stipend
Autonoumous Work Environement
In Office Set-Up Reimbursement
Flexible Time Off (FTO) + Holidays
Quarterly Team Gatherings
In Office Amenities
Company
Replit
Replit is the most secure agentic platform for production-ready apps.
H1B Sponsorship
Replit has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (8)
2024 (5)
2023 (2)
2022 (2)
Funding
Current Stage
Growth StageTotal Funding
$472.02MKey Investors
Prysm CapitalCraft VenturesAndreessen Horowitz
2025-07-30Series C· $250M
2023-11-06Series B· $20M
2023-04-25Series B· $97.4M
Recent News
2026-01-19
Company data provided by crunchbase