Datadog Tester / Observability QA Engineer jobs in United States
info-icon
This job has closed.
company-logo

OVA.Work ยท 2 weeks ago

Datadog Tester / Observability QA Engineer

OVA.Work is seeking a Datadog Tester / Observability QA Engineer responsible for validating monitoring, logging, tracing, and alerting implementations using Datadog. The role ensures accurate observability, correct alerting, and reliable dashboards across applications, APIs, and infrastructure.

ComplianceHuman ResourcesSoftware

Responsibilities

Validate application, API, and infrastructure metrics collected in Datadog
Ensure correct host, container, and service-level metrics
Verify metric thresholds, anomalies, and aggregation accuracy
Validate log ingestion from applications, APIs, servers, and cloud services
Ensure log parsing, indexing, tagging, and retention policies
Verify log search, filters, and correlation with metrics
Test distributed tracing for APIs and microservices
Validate service maps, latency, error rates, and throughput
Ensure trace-to-log and trace-to-metric correlation
Test alerts, monitors, and notifications (Email, Slack, PagerDuty, etc.)
Validate alert conditions, severity levels, and escalation rules
Perform negative testing to avoid false positives/negatives
Validate Datadog dashboards for accuracy and usability
Ensure correct widgets, queries, and time-series data
Verify role-based access and dashboard sharing
Test Datadog integrations with cloud providers (AWS/Azure/GCP), Kubernetes, Docker, databases, and CI/CD tools
Validate environment separation (Dev, QA, Stage, Prod)
Work with Dev, DevOps, SRE, and Product teams
Report observability gaps and monitoring defects
Assist in production issue root cause analysis (RCA)

Qualification

DatadogMonitoring TestingLog ManagementAPM & TracingCloud ProvidersKubernetesDockerCollaboration

Required

Experience with Datadog for monitoring, logging, tracing, and alerting implementations
Ability to validate application, API, and infrastructure metrics collected in Datadog
Experience ensuring correct host, container, and service-level metrics
Ability to verify metric thresholds, anomalies, and aggregation accuracy
Experience validating log ingestion from applications, APIs, servers, and cloud services
Ability to ensure log parsing, indexing, tagging, and retention policies
Experience verifying log search, filters, and correlation with metrics
Ability to test distributed tracing for APIs and microservices
Experience validating service maps, latency, error rates, and throughput
Ability to ensure trace-to-log and trace-to-metric correlation
Experience testing alerts, monitors, and notifications (Email, Slack, PagerDuty, etc.)
Ability to validate alert conditions, severity levels, and escalation rules
Experience performing negative testing to avoid false positives/negatives
Ability to validate Datadog dashboards for accuracy and usability
Experience ensuring correct widgets, queries, and time-series data
Ability to verify role-based access and dashboard sharing
Experience testing Datadog integrations with cloud providers (AWS/Azure/GCP), Kubernetes, Docker, databases, and CI/CD tools
Ability to validate environment separation (Dev, QA, Stage, Prod)
Experience collaborating with Dev, DevOps, SRE, and Product teams
Ability to report observability gaps and monitoring defects
Experience assisting in production issue root cause analysis (RCA)

Company

OVA.Work

twittertwitter
company-logo
OVA is the most advanced Automated, Intelligent, intuitive On-boarding platform for Staffing Firms of all sizes.

Funding

Current Stage
Growth Stage
Total Funding
unknown
2020-03-15Pre Seed
Company data provided by crunchbase