Karsun Solutions · 1 week ago
Site Reliability Engineer Manager
Wonder how qualified you are to the job?
ConsultingGovernment
H1B SponsorshipComp. & Benefits
Insider Connection @Karsun Solutions
Responsibilities
Lead a service delivery team of 8-20 people (Service Support specialist, DevSecOps and Site reliability engineers)
Define and implement best practices for infrastructure as code, deployment automation, and monitoring.
Collaborate with cross-functional teams to design scalable and fault-tolerant architectures.
Develop and maintain service level objectives (SLOs) and key performance indicators (KPIs) to measure system reliability and performance.
Conduct post-mortems and root cause analyses for incidents and implement preventive measures to mitigate future incidents.
Drive continuous improvement initiatives to enhance the reliability, scalability, and efficiency of systems and services.
Mentor and coach team members to foster a culture of learning and innovation.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Bachelor’s degree in computer science, Engineering, or a related field; Master's degree preferred.
10+ years of experience in a similar role managing a team of site reliability engineers and delivering in AWS cloud platform.
Proven track record of managing high-performance teams.
5+ years of experience supporting operations and maintenance for cloud-native applications in production that are fault-tolerant, self-healing, scalable and high available.
Deep understanding of cloud computing platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Docker, Kubernetes).
Strong knowledge of infrastructure as code tools (e.g., Terraform, Ansible, ArgoCD) and CI/CD pipelines.
Experience with monitoring, logging, and observability tools like DataDog, AWS Cloudwatch, ELK, Prometheus, Splunk etc.
Excellent communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams.
Strong problem-solving and analytical skills, with a keen attention to detail.
Certifications such as AWS Certified DevOps Engineer or Google Professional Cloud DevOps Engineer are a plus.
Ability to obtain and maintain a Public Trust clearance.
Preferred
Understanding of modern architecture, e.g. micro-services, EDA, etc., and cautious against overcomplexity and overengineering.
Experience with monitoring and metrics platforms, e.g. New Relic, Prometheus, InfluxDB, Grafana, Splunk, etc.
Experience designing and operating distributed systems and cloud infrastructure at scale.
Benefits
Health, Life & Disability Insurance – Medical, Dental, Life and Disability coverage is paid for by Karsun for full-time employees.
Paid Parental Leave
401k Retirement Plan – with pre-tax and post-tax ROTH contribution offerings and immediate vesting with a per pay period match
Generous time off programs including 11 paid holidays per year
Supplemental plans such as Vision, Pet Insurance and 529 Savings Plan
Employee Assistance Program with behavioral health, physical wellness and financial advice
Employee Discounts & Perks
In-house Technical/Skills Training
Company
Karsun Solutions
Karsun Solutions LLC specializes in Enterprise Modernization and Transformation solutions for the Federal Government.
H1B Sponsorship
Karsun Solutions has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Trends of Total Sponsorships
2023 (16)
2022 (18)
2021 (28)
2020 (25)
Funding
Current Stage
Growth StageCompany data provided by crunchbase