Senior Observability Platform Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

SS&C Technologies · 4 days ago

Senior Observability Platform Engineer

SS&C Technologies is a leading financial services and healthcare technology company headquartered in Windsor, Connecticut. The role offers an exciting opportunity for software engineers passionate about open source software, Linux, Kubernetes, and Observability, focusing on designing and maintaining a comprehensive observability stack.

Cloud ManagementEnterprise SoftwareFinanceFinancial ServicesHealth CareProfessional ServicesSoftwareSoftware Engineering
check
H1B Sponsor Likelynote

Responsibilities

Responsible for designing, developing, implementing, and maintaining our comprehensive observability stack, including tracing, telemetry, logging, health monitoring, visualization, and dashboards. You will play a key role in ensuring the reliability, performance, and operational efficiency of our services
Design and implement a robust observability framework using composable open source solutions like Prometheus, Alertmanager, OpenTelemetry, Grafana, Alloy, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar
Develop and maintain health monitoring and alerting systems for our compute platforms, databases, network infrastructure as well as Kubernetes-based platforms including GPU-supported environments
Create and manage visualization dashboards to monitor system performance, resource utilization, and operational health
Implement scalable, distributed logging and tracing solutions to diagnose, troubleshoot, and resolve system issues effectively
Collaborate with development and operations teams to integrate observability practices into the development lifecycle
Conduct performance analysis and optimization to ensure system reliability and efficiency
Stay updated with the latest trends and technologies in observability and performance monitoring
Collaborate with cross-functional teams (Cloud Engineering, Network, and DevOps/Solutions Engineering) to troubleshoot and resolve infrastructure issues

Qualification

Observability toolsKubernetesScripting languagesInfrastructure-as-codePerformance analysisProblem-solvingCommunication skills

Required

Designing, developing, implementing, and maintaining a comprehensive observability stack, including tracing, telemetry, logging, health monitoring, visualization, and dashboards
Designing and implementing a robust observability framework using composable open source solutions like Prometheus, Alertmanager, OpenTelemetry, Grafana, Alloy, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar
Developing and maintaining health monitoring and alerting systems for compute platforms, databases, network infrastructure, and Kubernetes-based platforms including GPU-supported environments
Creating and managing visualization dashboards to monitor system performance, resource utilization, and operational health
Implementing scalable, distributed logging and tracing solutions to diagnose, troubleshoot, and resolve system issues effectively
Collaborating with development and operations teams to integrate observability practices into the development lifecycle
Conducting performance analysis and optimization to ensure system reliability and efficiency
Staying updated with the latest trends and technologies in observability and performance monitoring
Collaborating with cross-functional teams (Cloud Engineering, Network, and DevOps/Solutions Engineering) to troubleshoot and resolve infrastructure issues
Bachelor's or Master's degree in Computer Science, Information Technology, or a related field

Preferred

Proven experience in observability, system and network monitoring, and system performance analysis, particularly in a cloud or data center environment
Expertise in implementing and managing observability tools and technologies such as composable open source solutions like Prometheus, Alertmanager, OpenTelemetry, Grafana, Alloy, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar commercial solutions
Hands-on experience with Kubernetes
Experience with infrastructure-as-code and configuration management tools such as Consul, GitHub, Salt Stack, Terraform, etc
Proficiency in scripting and automation using languages such as Go, Python, Shell
Excellent problem-solving skills and the ability to work independently or as part of a team
Strong communication skills and the ability to work in a fast-paced, dynamic environment

Benefits

Health
Dental
401k plan
Tuition and professional development reimbursement plan

Company

SS&C Technologies

company-logo
SS&C is a global provider of investment and financial software-enabled services and software focused exclusively on the global financial

H1B Sponsorship

SS&C Technologies has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (105)
2024 (59)
2023 (74)
2022 (35)
2021 (30)
2020 (21)

Funding

Current Stage
Public Company
Total Funding
$750M
2024-05-02Post Ipo Debt· $750M
2005-07-27Acquired
1996-05-31IPO

Leadership Team

leader-logo
Anthony Caiafa
Global CTO / CIO
linkedin
leader-logo
Brian Schell
Executive Vice President and Chief Financial Officer
linkedin
Company data provided by crunchbase