Data & Observability Architect jobs in United States
info-icon
This job has closed.
company-logo

INSPYR Solutions · 3 months ago

Data & Observability Architect

INSPYR Solutions is a national expert in delivering flexible technology and talent solutions. They are seeking a Data & Observability Architect to define and lead the strategy for collecting, storing, and serving observability data across the engineering organization, ensuring actionable insights from diverse telemetry sources.

Information TechnologyProfessional ServicesStaffing Agency
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Define/maintain a unified observability taxonomy across metrics, logs, traces; helping design a traceable observability platform
Design and implement ingestion → storage → retrieval pipelines with automation for large-scale observability data with tiered retention (hot/warm/cold)
Architect observability across all infrastructure layers (DC, network, storage, compute, Kubernetes, HPC, apps) - with multi-tenancy
Establish tech stack standards (e.g., VictoriaMetrics, Loki, Tempo, OpenTelemetry, Coralogix) for different observability signals
Help build persona-oriented views for Finance, Operation, Executives, Developers, Platform etc
Build and guide transparency around cost, observability and resiliency of the observability platform
Define and enforce data governance for telemetry (label taxonomy, cardinality budgets, PII handling etc.)
Partner with Platform, Security, and Solution Architecture teams to ensure observability onboarding, integrates with compliance, incident response, and developer workflows
Coach engineering teams on OpenTelemetry instrumentation and best practices for emitting metrics/logs/traces

Qualification

Observability platformsTelemetry pipelinesSRE principlesData governanceCommunication skills

Required

Strong expertise in observability platforms: Prometheus/VictoriaMetrics, Grafana, Loki/ELK, Tempo/Jaeger, OpenTelemetry
Experience designing large-scale telemetry pipelines with ingestion, retention, and query optimization
Experience reducing MTTD/MTTR by implementing detective, preventive and proactive monitoring/controls
Familiarity with SRE principles: SLIs, SLOs, error budgets, burn-rate alerting
Knowledge of data governance in observability contexts (taxonomy, labeling, cardinality control, PII redaction)
Hands-on skills with data pipelines (Kafka, Fluent Bit, Vector, Airflow) and object storage for archival
Strong communication and documentation skills to serve diverse stakeholders (finance, ops, exec, dev)

Preferred

12+ years in SRE, Platform Engineering, or Data Engineering with a focus on observability
Proven track record in building enterprise-wide observability strategies for hybrid/on-prem + cloud environments
Experience with high-performance computing (HPC) telemetry and schedulers (Slurm, LSF)
Experience with Resilience and Business Continuity
Exposure to multi-cloud observability integration (e.g., AWS CloudWatch, Azure Monitor) alongside on-prem stacks
Familiarity with cost modeling and chargeback tied to resource telemetry
Prior leadership in designing persona-based observability views for technical and business consumers

Company

INSPYR Solutions

twittertwitter
company-logo
INSPYR Solutions is a information technology staffing service providers.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Gregg Straus
Executive Vice President & Chief Financial Officer
linkedin
leader-logo
Michelle Wren
Chief Operating Officer
linkedin
Company data provided by crunchbase