Observability Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Novia Infotech · 23 hours ago

Observability Engineer

Novia Infotech is seeking a Mid-Level Observability Engineer to help implement, operate, and continuously improve observability capabilities across applications and platforms. This hands-on role focuses on onboarding, instrumentation, dashboarding, alerting, and automation while collaborating with application, SRE, and operations teams to ensure systems are observable and production ready.

Information Technology & Services
Hiring Manager
Chandan Dixit
linkedin

Responsibilities

Implement and maintain metrics, logs, and traces for applications and infrastructure
Onboard applications into observability platforms such as Dynatrace, ELK/Elastic, Datadog, or New Relic
Configure dashboards, alerts, and basic anomaly detection
Enable structured logging, core metrics, and basic distributed tracing with development teams
Validate observability requirements during Production Readiness Reviews (PRR)
Troubleshoot missing, incorrect, or low-quality telemetry
Configure alerts based on golden signals (latency, errors, traffic, saturation)
Reduce alert noise by tuning thresholds and alert logic
Support incident response by analyzing logs, metrics, and traces
Perform root cause analysis using observability tools
Maintain dashboards and documentation for on-call and support teams
Participate in on-call rotations as required
Automate observability onboarding and validation processes
Create reusable dashboards and alert templates
Follow established observability standards and best practices

Qualification

Observability platformsMetricsLogsAutomation scriptingSLIsSLOsCloud platformsKubernetesIncident management toolsOpenTelemetry

Required

2–4 years of experience in Observability or Site Reliability Engineering (SRE)
Strong understanding of metrics, logs, and basic tracing concepts
Hands-on experience with at least one observability platform (Dynatrace, ELK/Elastic, Datadog, New Relic, etc.)
Basic understanding of SLIs, SLOs, and service health indicators
Experience working with cloud platforms or hybrid environments
Ability to write automation and troubleshooting scripts using Python, Bash, or PowerShell

Preferred

Experience with OpenTelemetry or APM agents
Familiarity with Kubernetes and containerized workloads
Experience with incident management tools such as PagerDuty or ServiceNow
Exposure to Dynatrace, Kibana, ELK, or other cloud-native monitoring tools
Experience working in regulated or large enterprise environments

Company

Novia Infotech

twitter
company-logo
At Novia Infotech, we’re more than just a service provider — we’re your partner in professional growth.

Funding

Current Stage
Growth Stage
Company data provided by crunchbase