Crisis Text Line · 2 days ago
Senior Infrastructure Site Reliability Engineer
Wonder how qualified you are to the job?
Information TechnologyMessaging
Insider Connection @Crisis Text Line
Responsibilities
Lead, and maintain highly available, scalable, and secure infrastructure on AWS Fargate.
Design and maintain CloudWatch alerting and monitoring configurations to proactively identify and resolve potential issues.
Mentor and guide junior team members, sharing best practices and promoting a culture of excellence.
Collaborate with cross-functional teams to define and implement best practices for infrastructure as code (IaC), continuous integration/continuous deployment (CI/CD), and site reliability engineering (SRE) methodologies.
Lead in incident response and resolution, including troubleshooting complex system issues and implementing preventive measures to minimize downtime.
Automate repetitive tasks and processes to improve operational efficiency and reduce manual intervention.
Conduct performance tuning and optimization of infrastructure components to ensure optimal resource utilization and cost efficiency.
Stay up-to-date with emerging technologies and industry trends to drive innovation and continuous improvement.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Bachelor's degree in Computer Science, Engineering, or related field (Master's degree preferred) or equivalent experience.
Experience in site reliability engineering (SRE) or related roles, with a focus on cloud infrastructure management.
Hands-on experience with AWS services, particularly AWS Fargate, CloudWatch, and related tools.
Proficiency with infrastructure as code (IaC) tools such as Terraform or CloudFormation.
Strong scripting and automation skills using languages such as Python, Bash, or PowerShell.
Experience with container orchestration platforms such as Kubernetes or Amazon ECS.
Solid understanding of networking concepts, security best practices, and DevOps principles.
Strong problem-solving skills and the ability to work effectively in a fast-paced, collaborative environment.
Preferred
AWS certifications (e.g., AWS Certified Solutions Architect, AWS Certified DevOps Engineer) are a plus.
Benefits
20 paid holidays including Federal holidays like Juneteenth and Labor Day, Election day, Holiday break from Dec 24 through January 1, 2 renewal days, 2 floating holidays
Flexible paid time off, including 15 vacation days, 3 personal days, 7 sick days
Medical, dental, and vision benefits for the staff member and family at no cost to the employee
403B retirement plan: 3% contribution by Crisis Text Line
12 weeks paid parental leave (after 6 months of employment)
Student loan repayment (after 2 years of continuous full-time service)
Family support through a virtual childcare platform
Stipends/Allowances
Mental health (Monthly)
Internet Service (Monthly)
Professional Development (Annual)
Wellness (Annual)
Home office setup (One time/First year)
Company
Crisis Text Line
Crisis Text Line is free, 24/7 emotional support for those in crisis.
Funding
Current Stage
Growth StageTotal Funding
$30.8M2016-06-15Series B· $23.8M
2015-10-08Series Unknown· $7M
Recent News
2024-05-24
Company data provided by crunchbase