Senior Infrastructure Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Replit · 1 week ago

Senior Infrastructure Engineer

Replit is the agentic software creation platform that enables anyone to build applications using natural language. As a Senior Infrastructure Engineer, you will help ensure the reliability, scalability, and performance of Replit's infrastructure, implementing automation and best practices to support millions of developers worldwide.

Artificial Intelligence (AI)Cloud ComputingDeveloper ToolsInformation TechnologySoftware
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Drive Automation and Infrastructure as Code: Build and improve automation to eliminate toil and operational work. Maintain CI/CD pipelines and infrastructure automation using tools like Terraform or Pulumi. Create self-healing systems that can automatically respond to common failure scenarios
Optimize Performance and Infrastructure: Collaborate with core infrastructure and product teams to performance tune and optimize our cloud deployments (Kubernetes, Docker, GCP). Identify and resolve performance bottlenecks and implement capacity planning strategies
Elevate Developer Experience: Design and implement improvements to our build, test, and deployment systems to make software delivery faster, safer, and more reliable for all engineers
Drive Cross-Team Improvements: Partner with service owners across Replit to understand their pain points, and collaborate on implementing build/test/deploy enhancements within their specific services
Build Shared Tooling: Create and maintain centralized tooling and automation that improves the engineering lifecycle, from local development to production monitoring
Debug and Harden Systems: Dive deep into debugging difficult technical problems, making our systems and products more robust, operable, and easier to diagnose
Collaborate on Design Reviews: Participate in feature and system design reviews, contributing expertise on security, scale, and operational considerations
Build and Integrate: Write high-quality, well-tested code to meet the needs of your customers, including building pipelines to integrate with 3rd party vendors

Qualification

Site Reliability EngineeringInfrastructure as CodeContainer OrchestrationMonitoring SolutionsProgramming in PythonCloud-Native TechnologiesIncident ManagementDebugging SkillsPassion for Software CreationCommunicationCritical Thinking

Required

4+ years of experience in Site Reliability Engineering or similar roles (DevOps, Systems Engineering, Infrastructure Engineering)
Strong programming skills in languages like Python or Go
You write high-quality, well-tested code
Solid understanding of distributed systems. You've built, scaled, and maintained production services and understand service-oriented architecture
Experience with container orchestration platforms (Kubernetes) and cloud-native technologies
Experience implementing and maintaining monitoring/observability solutions, with strong skills in debugging and performance tuning
Strong incident management skills with experience participating in incident response and demonstrated critical thinking under pressure
Experience with infrastructure as code (e.g., Terraform) and configuration management tools
Excellent written and verbal communication skills, with an ability to explain technical concepts clearly
A willingness to dive into understanding, debugging, and improving any layer of the stack
You're passionate about making software creation accessible and empowering the next generation of builders

Preferred

Experience with Google Cloud Platform (GCP) services and tools
Knowledge of modern observability platforms (Prometheus, Grafana, Datadog, etc.)
Experience building reliable systems capable of handling high throughput and low latency
Experience with Go and Terraform
Familiarity with working in rapid-growth environments

Benefits

Competitive Salary & Equity
401(k) Program
Health, Dental, Vision and Life Insurance
Short Term and Long Term Disability
Paid Parental, Medical, Caregiver Leave
Commuter Benefits
Monthly Wellness Stipend
Autonoumous Work Environement
In Office Set-Up Reimbursement
Flexible Time Off (FTO) + Holidays
Quarterly Team Gatherings
In Office Amenities

Company

Replit

twittertwittertwitter
company-logo
Replit is the most secure agentic platform for production-ready apps.

H1B Sponsorship

Replit has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (8)
2024 (5)
2023 (2)
2022 (2)

Funding

Current Stage
Growth Stage
Total Funding
$472.02M
Key Investors
Prysm CapitalCraft VenturesAndreessen Horowitz
2025-07-30Series C· $250M
2023-11-06Series B· $20M
2023-04-25Series B· $97.4M

Leadership Team

leader-logo
Amjad Masad
CEO
linkedin
leader-logo
Haya Odeh
Co-Founder
linkedin
Company data provided by crunchbase