Walmart Canada · 3 months ago
Distinguished, Software Engineer - Observability
Walmart Inc. is a leading retail corporation, and they are seeking a Distinguished Software Engineer specializing in Observability. The role involves being a technical lead in the development of cloud-native observability designs, managing real-time telemetry software systems, and collaborating with various stakeholders to implement telemetry R&D projects.
DeliveryRetailShopping
Responsibilities
Be a key researcher and technical lead expert in the architecture and development of cloud native observability designs, managed services, and real-time telemetry software systems
Create visionary software architectures and telemetry systems to achieve an observability software product portfolio
Design, develop and implement large-scale distributed systems that process large volumes of data focusing on scalability, latency, and fault-tolerance in every system built
Effectively communicate and build collaboration at all areas and levels of the business and engineering
Utilize multiple telemetry technologies such as: data models, metric libraries, data logging, distributed tracing, datalakes, data correlation, rule based alerting engines, real-time data streaming pipelines, TSDBs, and application performance management (APM)
Create metric software designs and solutions enabling real-time monitoring and alerting of system and application metrics
Lead research initiatives for cloud native designs and implementation within public and private clouds
Utilize TSDBs and correlation and data fusion of multiple data types and heterogenous data streams coupled with Artificial intelligence (AI) and Learned Behaviors for anomaly detection, and forward projections of system and application expected behaviors
Collaborate with enterprise architects, product managers, data scientist, engineers and business managers to bring telemetry R&D projects into production
Use a combination of open source and COTS technologies to solve real-time telemetry problems at an enterprise-wide scale
Lead the design of new systems and the redesign of existing systems to meet business requirements, changing needs, and integration of state-of-the-art technology
Be an evangelist for the Observability foundation socialization technology designs and implementations to engineering and business customers
Qualification
Required
BS/MS in Computer Science, Engineering, or equivalent, with 15+ or more years in software engineering, design and architecture
This role requires a deep understanding of the Java language and associated frameworks and previous development of Java applications, Libs, SDK or services
Strong architecture leadership with demonstrated enterprise level software implementations
Previous demonstrated architectural leadership in research, evaluation, creation of software designs, and distributed software implementations in production
Experience with technical leadership, software roadmaps, research and development, new software initiatives and customer and engineering coordination and engagement
Full stack cloud software development experience
API development, integration, and utilization
Cloud technologies and cloud native designs
Cloud infrastructures and technologies, such as OpenStack, Azure, GCP or AWS
Large scale distributed systems experience including scalability and fault tolerance
TSDBs (InfluxDB, Kairos, Cortex, Thanos, Prometheus) or equivalent
Extract, transform, and load (ETL) processes
Real-time telemetry pipelines and publish/subscribe models (Kafka or equivalent)
Data warehousing, datalakes, processing and data analytics
SQL (AzureSQL, Postgress or equivalent) a solid foundation in advanced SQL
Unix/Linux shell scripting or similar programming/scripting knowledge
Real-time time monitoring and alerting: metric agents, real-time dashboards, alerting rules
Excellent written and verbal communication skills for diverse audiences based on engineering subject matter
Ability to document requirements, architectural designs, and analysis findings in both business and technical terminology
Software development in an Agile iterative CI/CD development environment
Promote and support company policies, procedures, mission, values, and standards of ethics and integrity
Preferred
Knowledge and/or use of agentic AI – Model context protocol (MCP) servers, Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), Natural Language processing (NPL)
Fluency in Python, JavaScript, advanced shell scripting, Configuration management -Ansible, chef, puppet
Experience with Application Performance Monitoring (APM) and/or Distributed Tracing
Deployment of Kubernetes, containers, service meshes, and micro services
Micro services architectures, Istio, and micrometer
Open Telemetry standards and protocols
Go development
Observability tools and system architectures
Experience in creating and maintaining managed metric services
NoSQL (Cassandra, CosmosDB or equivalent)
Storm, Spark or similar real-time streaming software
Knowledge of UI development - JavaScript, HTML, CSS and experience with frameworks like React and AngularJS
Involvement and contribution with open-source software communities
Demonstrated background in developing software systems
Benefits
Health benefits include medical, vision and dental coverage.
Financial benefits include 401(k), stock purchase and company-paid life insurance.
Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty, and voting.
Other benefits include short-term and long-term disability, company discounts, Military Leave Pay, adoption and surrogacy expense reimbursement, and more.
You will also receive PTO and/or PPTO that can be used for vacation, sick leave, holidays, or other purposes.
Live Better U is a Walmart-paid education benefit program for full-time and part-time associates in Walmart and Sam's Club facilities. Programs range from high school completion to bachelor's degrees, including English Language Learning and short-form certificates. Tuition, books, and fees are completely paid for by Walmart.
Company
Walmart Canada
Walmart Canada is a subsidiary of Walmart that operates a chain of more than 400 stores nationwide. It is a sub-organization of Walmart.
Funding
Current Stage
Late StageRecent News
Canada NewsWire
2025-12-18
Canada NewsWire
2025-12-03
Company data provided by crunchbase