Fusemachines · 4 hours ago
Senior Data Engineer/ Tech Lead
Fusemachines is a global provider of enterprise AI products and services, on a mission to democratize AI. They are seeking a Senior Data Engineer and Technical Lead to modernize legacy systems into a cloud-native architecture while training and mentoring the engineering team on Azure best practices.
Artificial Intelligence (AI)Big DataMachine LearningSoftware
Responsibilities
Serve as the Lead Engineer to design and implement scalable, hybrid data solutions that bridge legacy on-premises databases with modern Azure cloud-native layers
Actively train and mentor junior engineers, fostering a culture of technical excellence and ensuring the team adheres to Azure and Fabric best practices
Develop and maintain Event-Driven architectures using CDC and Spark Streaming to achieve sub-2-second data propagation for time-sensitive financial reporting
Architect Incremental Data Sync (Delta-based) workflows to replace high-latency full refreshes, significantly reducing operational risk and API usage
Design and govern a GraphQL Access Layer and Azure Functions to mask legacy system limitations and provide conditioned, validated data to frontend applications
Implement a robust Data Quality Framework using Python and SQL to identify anomalies and ensure data integrity across the Lakehouse and Warehouse layers
Manage and fine-tune Azure/Fabric resources, including Spark Pools, SKUs, and Capacity settings, to ensure high performance during peak times while maintaining cost-efficiency
Build and enforce proactive monitoring frameworks utilizing Power BI dashboards and Microsoft Teams notifications for real-time failure alerts
Work closely with cross-functional stakeholders and third-party vendors to align API contracts and business logic for seamless integration with Salesforce or other applications
Qualification
Required
5+ years of hands-on data engineering experience with deep expertise in the Azure ecosystem
Proven experience with Microsoft Fabric, specifically Eventstreams, Lakehouses (Delta over Blob/Data Lake Storage), Fabric Notebooks, Data Factory, and GraphQL
Deep knowledge of Change Data Capture (CDC) and Transaction Replication from on-premises SQL Server 2012 to the cloud
Expertise in Spark Streaming and PySpark for real-time data processing
Hands-on experience with GraphQL in Fabric and Azure Functions (acting as proxies/API bridges)
Proficiency in Azure API Management (APIM) for governing secure interfaces and managing throttling/quotas
Experience integrating complex third-party platforms like Salesforce and low-code/no-code applications via API-led connectivity
Advanced SQL (stored procedures, window functions) and Python/PySpark for optimized data processing
Strong background in ETL/ELT orchestration using Azure Data Factory and Airflow
Strong experience with PowerBI
Deep understanding of SDLC/Agile and Azure DevOps for CI/CD and artifact management
Knowledge of Azure security best practices (AD, NSG, encryption) and government compliance standards
Preferred
Azure Data Engineer Associate
Fabric Data Engineer Associate
Azure Solutions Architect Expert
Company
Fusemachines
Fusemachines is an enterprise AI services and solutions provider that brings AI education, products, and jobs to underserved communities.
H1B Sponsorship
Fusemachines has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (1)
2021 (3)
2020 (13)
Funding
Current Stage
Public CompanyTotal Funding
$9.87MKey Investors
Consilium Investment ManagementBusiness Oxygen (BO2)
2025-12-23Post Ipo Equity· $1M
2025-10-23IPO
2022-01-14Private Equity· $1M
Recent News
legacy.thefly.com
2026-02-05
2026-02-02
2026-02-02
Company data provided by crunchbase