Tata Consultancy Services · 14 hours ago
Site Reliability Engineer
Tata Consultancy Services is seeking a Site Reliability Engineer to ensure the health and performance of production systems. The role involves troubleshooting incidents, collaborating with engineering teams, and implementing reliability best practices.
Enterprise SoftwareCloud ComputingConsultingInformation TechnologyBusiness Information SystemsIT Management
Responsibilities
Excellent problem-solving skills and the ability to work under pressure in a fast-paced environment
Monitor and maintain the health, availability, and performance of production systems and applications
Troubleshoot and resolve production incidents, ensuring minimal downtime and service disruption
Identifying Defects and working with Dev to get them fixed based on priority
Taking care of implementation of RFCs
Doing pre and post validation of servers during traffic diversion
Collaborate with engineering teams to implement reliability best practices and improve system performance
Develop and maintain monitoring alerts and dashboards to ensure visibility into system metrics
Participate in on-call rotation and provide timely support for high-impact incidents
Implement automation tools and processes to streamline operations and reduce manual workloads
Document incidents and solutions for knowledge management and continuous improvement
Qualification
Required
Core Java
Splunk
Kibana
Grafana
Databases: Postgres
Databases: MongoDB
Experience in Production support engineering or SRE roles, preferably within the banking industry
Skilled in L1/L2 support, debugging, performance monitoring, and working in Agile/Scrum environments
Hands-on with ServiceNow
Hands-on with Spring Boot
Hands-on with REST APIs
Hands-on with CI/CD pipelines
Strong knowledge of cloud services
Excellent problem-solving skills and the ability to work under pressure in a fast-paced environment
Monitor and maintain the health, availability, and performance of production systems and applications
Troubleshoot and resolve production incidents, ensuring minimal downtime and service disruption
Identifying Defects and working with Dev to get them fixed based on priority
Taking care of implementation of RFCs
Doing pre and post validation of servers during traffic diversion
Collaborate with engineering teams to implement reliability best practices and improve system performance
Develop and maintain monitoring alerts and dashboards to ensure visibility into system metrics
Participate in on-call rotation and provide timely support for high-impact incidents
Implement automation tools and processes to streamline operations and reduce manual workloads
Document incidents and solutions for knowledge management and continuous improvement
BACHELOR OF COMPUTER SCIENCE
Company
Tata Consultancy Services
Tata Consultancy Services is a business solutions company that specializes on information technology services and consulting.
H1B Sponsorship
Tata Consultancy Services has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (7880)
2024 (9690)
2023 (8537)
2022 (11159)
2021 (9813)
2020 (11984)
Funding
Current Stage
Public CompanyTotal Funding
unknown2004-08-25IPO
Leadership Team
Recent News
2026-02-12
2026-02-12
Company data provided by crunchbase