Dayforce ยท 1 week ago
Data Reliability Engineer Sr
Wonder how qualified you are to the job?
Software Development
Insider Connection @Dayforce
Responsibilities
Engineering solutions to improve the reliability of data and data processes
Data lifecycle procedures (e.g., when and how data gets deprecated)
Data SLI, SLA, SLO definition and documentation
Data observability strategy and implementation
Data pipeline code review, and testing framework
Data outage triage and response process
Data ownership strategy and documentation
Education and culture-building (e.g., internal roadshow to explain data SLAs)
Developing guardrails around data processes
Automating data related processes in the infrastructure to constantly remove toil
Monitoring costs of data activities (pipelines, storage, compute, network, etc.)
Track the lineage of the data
Perform change management when data tooling changes
Ensure cross-team communication regarding data activities
Ensure PII (Personal Identifiable Information) is properly handled in the data ecosystem
Ensure the business is compliant with all regulations regarding data (i.e., GDPR, etc.)
Ensure that the Machine Learning models are versioned, reproducible, evaluated, monitored, and comply with overall software engineering best practices
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
7+ years relevant experience creating, reviewing, debugging, costing, and maintaining data pipelines in both public and private clouds
Fluent writing and reviewing code in at least one of those languages: Java, Scala, or C#
Experience in scripting languages (bash, Python) for automating processes
Experience with pipeline orchestration tools, such as Airflow or Dagster
Experience with Infrastructure-as-code technologies such as Kubernetes, Terraform, Docker
Experience with data streaming technologies such as Kafka, Flink
Experience of BI, Data Warehousing and Data Lake
Good experience with monitoring & observability tools, such as AppDynamics, Datadog, Honeycomb with proven ability to troubleshoot and problem solve in complex systems
Experience with Machine Learning lifecycle from modeling to production
Demonstrated ability to quickly adapt, learn new skill sets, and be able to understand operational challenges
Strong analytical, problem-solving, negotiation and organizational skills with a clear focus under pressure
Resourceful, results orientated with the ability to get things done and overcome obstacles
Excellent interpersonal skills, including relationship building with diverse, global, cross-functional team
Excellent written and verbal communication in English
Highly motivated and energetic personality capable of influencing positive change and growth
Ability to obtain or currently possess Canadian Protected B security clearance (resident in Canada 5 years or longer, clean record)
Preferred
Azure Administrator and Architect certifications a plus