Senior Site Reliability Engineer @ Tekshapers | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
Senior Site Reliability Engineer jobs in United States
49 applicants
expire-info-iconThis job has closed.
company-logo

Tekshapers ยท 2 days ago

Senior Site Reliability Engineer

Wonder how qualified you are to the job?

ftfMaximize your interview chances
Software
Hiring Manager
Akanksha Somal
linkedin

Insider Connection @Tekshapers

Discover valuable connections within the company who might provide insights and potential referrals, giving your job application an inside edge.

Responsibilities

Participate in design and architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance.
Evangelize SRE evolution within IT operations and promote a culture of engineering excellence and best practices.
Define best practices and principles for SRE, including incident management, monitoring, alerting, and automation.
Collaborate with development teams on resiliency to ensure services and applications are designed with operational reliability in mind.
Implement monitoring systems to assess performance of applications and infrastructure, and proactively identify areas for optimization.
Understand incident and problem management process, post-mortems, and drive improvements to prevent future incidents.
Analyze resource utilization patterns and forecast future capacity needs to ensure optimal performance and cost-efficiency.
Ensure SRE practices align with security and compliance requirements and implement measures to protect systems and data.
Focus on automation and develop tools to streamline operational tasks and increase efficiency.
Provide guidance and mentorship to SRE teams, fostering skill development and building a strong and capable SRE practice.
Develop close relationships with other operational teams to integrate SRE practices and drive overall operational improvements across the enterprise.
Stay up to date on industry trends, new technologies, and best practices in SRE and apply relevant advancements to the organization.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Cloud TechnologiesSRE ToolsetsAutomationAWSDockerKubernetesLinux CommandsGitLab CICDTerraformMonitoringAlertingSplunkPrometheusGrafanaKibanaELKAPM ToolsDatadogAppDynamicsDynatraceObservability FrameworkSLI/SLOAnsibleScripting LanguagesGroovy-DSLJavaPythonYamlMicroservices ArchitectureMQ

Required

Around 10-12 years of SRE hands-on experience with cloud technologies, development, SRE toolsets, and automation
Strong hands-on experience with any Cloud Technology (AWS): Control Tower, Project Setup, Creating Accounts, RDS, SSO
Solid understanding and hands-on experience with Docker/Kubernetes
Should have good experience with Linux Commands, GitLab CICD Setup, and Terraform (state management, etc)
Monitoring & alerting setup experience with Splunk, Prometheus, Grafana, Kibana, ELK etc.
Hands-on APM Tool/s experience, preferably Datadog or AppDynamics or Dynatrace
Good understanding of Observability Framework leveraging programmatic SLI/SLO blueprints to standardize the collection of golden signals
Should have automation (data refresh, releases, DB snapshots) experience using Ansible or any other scripting languages
Experience with following languages (Groovy-DSL, Java, Python, Yaml, and microservices architecture)
Good understanding and hands-on experience with MQ, Kafka
Experience with Databases (Oracle, MySQL)

Preferred

Any of the relevant professional certifications โ€“ Certified Site Reliability Engineer (CSRE), Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer Professional, Google Cloud Professional; DevOps Engineer

Company

Tekshapers

twittertwittertwitter
company-logo
Tekshapers was founded in 2009, an MI, USA based Information Technology Company and our primary objective is to provide sophisticated business solutions to a group of companies worldwide.

Funding

Current Stage
Late Stage
Company data provided by crunchbase
logo

Orion

Your AI Copilot