Yassir · 3 months ago
Mid/Senior Data Engineer
Yassir is the leading super App in the Maghreb region, transforming daily services and expanding into financial services to foster a digital economy. The Mid/Senior Data Engineer will be responsible for building a centralized data lake, developing data processing pipelines, and collaborating with cross-functional teams to extract insights from data.
Financial ServicesInformation TechnologyMobile AppsTransportation
Responsibilities
Build a centralized data lake on GCP Data services by integrating diverse data sources throughout the enterprise
Develop, maintain, and optimize SPARK-powered batch and streaming data processing pipelines. Leverage GCP data services for complex data engineering tasks and ensure smooth integration with other platform components
Design and implement data validation and quality checks to ensure data's accuracy, completeness, and consistency as it flows through the pipelines
Work with the Data Science and Machine Learning teams to engage in advanced analytics
Collaborate with cross-functional teams, including data analysts, business users, operational and marketing teams, to extract insights and value from data
Collaborate with the product team to design, implement, and maintain the data models for analytical use cases
Design, develop, and upkeep data dashboards for various teams using Looker Studio
Engage in technology explorations, research and development, POC’s and conduct deep investigations and troubleshooting
Design and manage ETL/ELT processes, ensuring data integrity, availability, and performance
Troubleshoot data issues and conduct root cause analysis when reporting data is in question
Qualification
Required
Build a centralized data lake on GCP Data services by integrating diverse data sources throughout the enterprise
Develop, maintain, and optimize SPARK-powered batch and streaming data processing pipelines
Leverage GCP data services for complex data engineering tasks and ensure smooth integration with other platform components
Design and implement data validation and quality checks to ensure data's accuracy, completeness, and consistency as it flows through the pipelines
Work with the Data Science and Machine Learning teams to engage in advanced analytics
Collaborate with cross-functional teams, including data analysts, business users, operational and marketing teams, to extract insights and value from data
Collaborate with the product team to design, implement, and maintain the data models for analytical use cases
Design, develop, and upkeep data dashboards for various teams using Looker Studio
Engage in technology explorations, research and development, POC's and conduct deep investigations and troubleshooting
Design and manage ETL/ELT processes, ensuring data integrity, availability, and performance
Troubleshoot data issues and conduct root cause analysis when reporting data is in question
PySpark - Batch and Streaming
GCP - Dataproc, Dataflow, DataStream, Dataplex, Pub/Sub, BigQuery and Cloud Storage
NoSQL (preferably MongoDB)
Programming languages: Scala/Python
Great Expectation, or similar DQ framework
Familiarity with workflow management tools like: Airflow, Prefect or Luigi
Understanding of Data Governance, Data Warehousing and Data Modelling
Good SQL knowledge
Able to communicate effectively, distill technical knowledge into digestible messages in a succinct / visual way
Proactively identify and contribute with team development initiatives, and supporting junior members
Preferred
Infrastructure-as-Code, preferably Terraform
Docker and Kubernetes
Looker
AI / ML engineering knowledge
Lineage, or relevant tools e.g. Atlan
DBT
Company
Yassir
The leading super app offers on-demand services, ride-hailing, delivery, and payment. It's revolutionizing how daily services are provided.
Funding
Current Stage
Late StageTotal Funding
$180MKey Investors
BondWndrCoUnpopular Ventures
2022-11-07Series B· $150M
2021-11-29Series A· $30M
2021-03-21Seed
Recent News
2026-01-11
2026-01-09
Fintechnews Middle East
2025-12-16
Company data provided by crunchbase