Senior DevOps/SRE Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

VITG ยท 2 weeks ago

Senior DevOps/SRE Engineer

VITG is seeking a skilled Senior DevOps Site Reliability Engineer (SRE) to ensure the reliability, availability, and performance of enterprise services hosted across Cloud Service Providers and on-prem data centers. The role involves implementing SRE principles, developing monitoring solutions, and collaborating with cross-functional teams to enhance DevOps practices.

Information Technology & Services
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Design and develop monitoring solutions leveraging approved AWS services using Infrastructure as Code (IaC) tools
Develop and maintain CI/CD pipelines using Github, Jenkins
Develop serverless functions and scripts using python, curl, and/or bash
Leverage observability best practices to proactively identify potential software issues and implement preventive measures to minimize potential for system incidents and outages
Set and monitor critical metrics to gain insights into system reliability, including latency, traffic, errors, and saturation levels
Learn and adapt new technologies to perform POCs (Proof of Concepts) based on project needs
Provide guidance, training, and support for external development teams to manage their infrastructure independently
Develop, publish, and maintain all required documentation in the repository and ticketing system (i.e., Confluence and Jira)
Respond quickly and effectively to critical incidents, conduct post-incident reviews to identify root causes and implement preventive measures
Collaborate effectively with cross-functional teams and communicate SRE concepts and recommendations clearly to both technical and non-technical stakeholders
Participate in reliability-based release management processes
Plan, participate and manage on-call rotations to ensure prompt response to reported performance and reliability issues
Attend ongoing and ad hoc meetings with internal and external stakeholders
Stay up-to-date with the latest industry trends, technologies, and best practices related to SRE, DevOps, and infrastructure management

Qualification

SRE principlesAWS servicesMonitoring toolsRoot cause analysisShell/Bash scriptingPython scriptingCI/CD pipelinesInfrastructure as CodeProblem solvingEffective communicationTeam playerContinuous learning

Required

Must be a US citizen or must be authorized to work in the United States
Must have lived in the USA for three (3) of the last five (5) years
Must be able to obtain a US federal government badge and eligible for Public Trust clearance
Must be able to pass a VITG background check, including a drug test
Demonstrate hand-on expertise in SRE principles, with a strong understanding of maintaining quality and stability of enterprise services in a continuous development environment
Must possess experience designing and developing solutions using various AWS services
Must possess experience in developing scripts in Shell/Bash, Python and deploying them as step/lambda functions
Must possess experience working with monitoring and administering observability tools like Splunk, Datadog, and New Relic
Possess extensive knowledge in troubleshooting issues while leveraging monitoring tools like Splunk, Datadog, New Relic, AWS services, etc
Possess skill related to analyzing, identifying and documenting root cause analysis
Possess a strong technical background and be able to provide clear explanations of technical concepts verbally and in writing
Demonstrate ability and passion to learn new technologies quickly and perform Proof of Concepts (POCs) based on project needs
Apply strong problem solving skills in monitoring system performance, troubleshooting issues, crisis management, etc
Produce high quality work independently and collaboratively
Excel in a fast-paced environment
Demonstrate effective communication and collaboration, and be a team player
AWS Certified SysOps/DevOps Associate or equivalent AWS certification (Required)

Preferred

Splunk Core Certified Certification (Strongly Preferred)
Datadog Certification (Strongly Preferred)

Benefits

401(k) with employer contribution
Medical/Dental/Vision insurance (option for full coverage for employee)
Life, ST/LT insurance
Professional development opportunities
Company-paid holidays and paid vacation (PTO)

Company

VITG

twitter
company-logo
VITG is a SBA 8(a) Certified & HUBZone Certified Small Minority & Disadvantaged Business.

Funding

Current Stage
Early Stage

Leadership Team

leader-logo
Vasu Togari
Chief Executive Officer
linkedin
Company data provided by crunchbase