SRE Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

InterSources Inc · 5 months ago

SRE Engineer

InterSources Inc is currently seeking a highly skilled SRE hands-on Lead Engineer to help lead transformational initiatives within IT operations. In this role, you will design and implement cutting-edge SRE solutions while driving the transformation of IT operations organizations to adopt an engineering-centric approach.

Artificial Intelligence (AI)Cyber SecurityInformation TechnologySoftware
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Participate in design, architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance
Primary skillset to be expertise in Observability as service, Telemetry data collection using Dynatrace APM, SolarWinds, Open-Source tools (Prometheus and Grafana), Log Aggregations (Kibana or Splunk) and AIOPS Tools
Configure application performance monitoring (APM), infrastructure monitoring, synthetic monitoring, RUM, and log monitoring
Integrate Dynatrace with CI/CD pipelines, alerting tools, ITSM systems, and incident automation frameworks
Tune alert thresholds, baselines, and AI-driven anomaly detection to reduce noise and improve actionable insights
Deeper understanding of Login authentication mechanisms using Ping, ForgeRock and SiteMinder technologies (session management and cookie management)
Correlation mechanisms and dashboards to have end to end visibility of requests from external to internal applications
Evangelize SRE evolution within IT operations and promoting a culture of engineering excellence and best practices
Define best practices and principles for SRE, including incident management, monitoring, alerting, and automation
Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind
Implement monitoring systems to assess the performance of applications and infrastructure, and proactively identifying areas for optimization
Understanding incident and problem management process, post-mortems, and driving improvements to prevent future incidents
Analyze resource utilization patterns and forecasting future capacity needs to ensure optimal performance and cost-efficiency
Ensure that SRE practices align with security and compliance requirements and implementing measures to protect systems and data
Operational excellence with focus on automation and developing tools to streamline operational tasks and increase efficiency
Provide guidance and mentorship to SRE teams, fostering skill development, and building a strong and capable SRE practice

Qualification

Telemetry data collectionApplication performance monitoringObservability expertiseIncident managementAutomation tools developmentCollaboration with development teamsGuidanceSecurityComplianceMentorship

Required

Expertise in Observability as service, Telemetry data collection using Dynatrace APM, SolarWinds, Open-Source tools (Prometheus and Grafana), Log Aggregations (Kibana or Splunk) and AIOPS Tools
Configure application performance monitoring (APM), infrastructure monitoring, synthetic monitoring, RUM, and log monitoring
Integrate Dynatrace with CI/CD pipelines, alerting tools, ITSM systems, and incident automation frameworks
Tune alert thresholds, baselines, and AI-driven anomaly detection to reduce noise and improve actionable insights
Deeper understanding of Login authentication mechanisms using Ping, ForgeRock and SiteMinder technologies (session management and cookie management)
Correlation mechanisms and dashboards to have end to end visibility of requests from external to internal applications
Evangelize SRE evolution within IT operations and promoting a culture of engineering excellence and best practices
Define best practices and principles for SRE, including incident management, monitoring, alerting, and automation
Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind
Implement monitoring systems to assess the performance of applications and infrastructure, and proactively identifying areas for optimization
Understanding incident and problem management process, post-mortems, and driving improvements to prevent future incidents
Analyze resource utilization patterns and forecasting future capacity needs to ensure optimal performance and cost-efficiency
Ensure that SRE practices align with security and compliance requirements and implementing measures to protect systems and data
Operational excellence with focus on automation and developing tools to streamline operational tasks and increase efficiency
Provide guidance and mentorship to SRE teams, fostering skill development, and building a strong and capable SRE practice

Company

InterSources Inc

twittertwittertwitter
company-logo
Innovative IT Solutions Built for Your Business. InterSources Inc is your partner in Growth and CyberSecurity.

H1B Sponsorship

InterSources Inc has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (4)
2024 (5)
2023 (16)
2022 (1)
2021 (7)
2020 (7)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Ankit Shah
Chief Executive Officer
linkedin
leader-logo
Jigar kyada
Business Development Manager/Client Partner
linkedin
Company data provided by crunchbase