ServiceLink · 3 hours ago
Data Engineering Lead
ServiceLink is building a modern Azure lakehouse platform for natural language analytics. The Data Engineering Lead will architect and build this platform while mentoring a team of engineers, focusing on data platform architecture and production-grade ELT/ETL pipelines.
Asset ManagementFinancial ServicesInsurance
Responsibilities
Own the data platform architecture (Azure Data Lake Gen2, Delta Lake, Lakehouse) and build production‑grade ELT/ETL pipelines (PySpark, SQL, Python)
Implement a semantic layer/metrics store to enable natural language → SQL/metric translation and consistent KPI definitions across the business
Design and operate real‑time and batch pipelines using ADF/Synapse/Databricks Workflows; implement medallion architecture, schema evolution, and data contracts
Build the retrieval layer for LLMs (embeddings, metadata, grounding context) using Azure OpenAI + Azure AI Search (or vectorized Delta tables) to support chat‑based analytics
Implement data quality, lineage, and observability (e.g., Great Expectations, Unity Catalog/Purview), plus cost governance (partitioning, Z‑order, compaction)
Deliver automated anomaly detection and alerting (time‑series baselines, isolation forests, Azure ML pipelines, Event Grid/Functions)
Partner with product/ops leaders to translate vague analytical questions into robust data models, metrics, and queries with clear SLAs
Lead, mentor, and uplevel a team of data & Python engineers; establish patterns, reviews, and documentation; own CI/CD and IaC (Bicep/Terraform)
Drive security, privacy, and compliance by design (RBAC, least privilege, PII handling, encryption, auditability)
Qualification
Required
7–10+ years in data engineering; 2–4+ years leading small teams while staying hands‑on (50–70%)
Expert in Azure Data Lake Gen2, Delta Lake, Unity Catalog (or Fabric equivalent), PySpark, SQL, and Python
Proven experience designing Lakehouse/medallion architectures, incremental loads, MERGE/UPSERT patterns, schema evolution
Strong command of Databricks (or Fabric Lakehouse), ADF/Synapse/Databricks Workflows, and monitoring/observability
Built or contributed to a semantic/metrics layer and query optimization for complex, multi‑join analytics
Practical experience with Azure OpenAI integrations, retrieval/RAG, embeddings, vector search, and grounding structured data
CI/CD for data (GitHub Actions/Azure DevOps), IaC (Bicep/Terraform), testing frameworks for pipelines, data contracts
Excellent communication; able to translate business questions into data models and mentor engineers
Preferred
Azure ML pipelines; time‑series forecasting; root cause analysis frameworks
Great Expectations/Monte Carlo; OpenLineage; Purview; Fabric Semantic Models
Event‑driven patterns (Event Grid/Service Bus), streaming (Kafka/Event Hubs)
Experience in operations/financial services/valuations domains
Company
ServiceLink
ServiceLink offers asset management, insurance, and mortgage loan services. It is a sub-organization of Fidelity National Financial.
Funding
Current Stage
Late StageRecent News
2025-12-13
HousingWire.com
2025-12-12
2025-11-28
Company data provided by crunchbase