Senior Data Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Curotec ยท 5 months ago

Senior Data Engineer

Curotec is an IT Services company seeking a Senior Data Engineer to support the ingestion, processing, and synchronization of data across their analytics platform. This role focuses on using Python Notebooks to ingest data via APIs into Microsoft Fabric's Data Lake and Data Warehouse, ensuring data availability and accuracy for reporting needs.

Artificial Intelligence (AI)Cloud Data ServicesCloud ManagementConsultingEnterprise ApplicationsInformation TechnologyOutsourcingSaaSSoftwareSoftware Engineering
check
Growth Opportunities

Responsibilities

Build and maintain Python Notebooks to ingest data from third-party APIs
Design and implement Medallion layer architecture (Bronze, Silver, Gold) for structured data organization and progressive data refinement
Store and manage data within Microsoft Fabric's Data Lake and Warehouse using delta parquet file formats
Set up data pipelines and sync key datasets to Azure Synapse Analytics
Develop PySpark-based data transformation processes across Bronze, Silver, and Gold layers
Collaborate with developers, analysts, and stakeholders to ensure data availability and accuracy
Monitor, test, and optimize data flows for reliability and performance
Document processes and contribute to best practices for data ingestion and transformation

Qualification

PythonPySparkMedallion architectureMicrosoft FabricAzure Synapse AnalyticsRESTful APIsDelta LakeData warehousingCloud workflowsData modelingAI coding toolsFivetranAribyteRiverly

Required

Strong experience with Python for data ingestion and transformation
Proficiency with PySpark for large-scale data processing
Proficiency in working with RESTful APIs and handling large datasets
Experience with Microsoft Fabric or similar modern data platforms
Understanding of Medallion architecture (Bronze, Silver, Gold layers) and data lakehouse concepts
Experience working with Delta Lake and parquet file formats
Understanding of data warehousing concepts and performance tuning
Familiarity with cloud-based workflows, especially within the Azure ecosystem

Preferred

Experience with marketing APIs such as Google Ads or Google Analytics 4
Familiarity with Azure Synapse and Data Factory pipeline design
Understanding of data modeling for analytics and reporting use cases
Experience with AI coding tools
Experience with Fivetran, Aribyte, and Riverly

Company

Curotec

twittertwittertwitter
company-logo
Latin American Nearshore Software Development & Staff Augmentation Powerhouse