Data Engineering Lead jobs in United States
info-icon
This job has closed.
company-logo

ServiceLink · 8 hours ago

Data Engineering Lead

ServiceLink is building a modern Azure lakehouse platform for natural language analytics. The Data Engineering Lead will architect and build this platform while mentoring a team of engineers, focusing on data platform architecture and production-grade ELT/ETL pipelines.

Asset ManagementFinancial ServicesInsurance
badNo H1Bnote

Responsibilities

Own the data platform architecture (Azure Data Lake Gen2, Delta Lake, Lakehouse) and build production‑grade ELT/ETL pipelines (PySpark, SQL, Python)
Implement a semantic layer/metrics store to enable natural language → SQL/metric translation and consistent KPI definitions across the business
Design and operate real‑time and batch pipelines using ADF/Synapse/Databricks Workflows; implement medallion architecture, schema evolution, and data contracts
Build the retrieval layer for LLMs (embeddings, metadata, grounding context) using Azure OpenAI + Azure AI Search (or vectorized Delta tables) to support chat‑based analytics
Implement data quality, lineage, and observability (e.g., Great Expectations, Unity Catalog/Purview), plus cost governance (partitioning, Z‑order, compaction)
Deliver automated anomaly detection and alerting (time‑series baselines, isolation forests, Azure ML pipelines, Event Grid/Functions)
Partner with product/ops leaders to translate vague analytical questions into robust data models, metrics, and queries with clear SLAs
Lead, mentor, and uplevel a team of data & Python engineers; establish patterns, reviews, and documentation; own CI/CD and IaC (Bicep/Terraform)
Drive security, privacy, and compliance by design (RBAC, least privilege, PII handling, encryption, auditability)

Qualification

Azure Data Lake Gen2Delta LakePySparkSQLPythonDatabricksADF/SynapseCI/CDIaCCommunication

Required

7–10+ years in data engineering; 2–4+ years leading small teams while staying hands‑on (50–70%)
Expert in Azure Data Lake Gen2, Delta Lake, Unity Catalog (or Fabric equivalent), PySpark, SQL, and Python
Proven experience designing Lakehouse/medallion architectures, incremental loads, MERGE/UPSERT patterns, schema evolution
Strong command of Databricks (or Fabric Lakehouse), ADF/Synapse/Databricks Workflows, and monitoring/observability
Built or contributed to a semantic/metrics layer and query optimization for complex, multi‑join analytics
Practical experience with Azure OpenAI integrations, retrieval/RAG, embeddings, vector search, and grounding structured data
CI/CD for data (GitHub Actions/Azure DevOps), IaC (Bicep/Terraform), testing frameworks for pipelines, data contracts
Excellent communication; able to translate business questions into data models and mentor engineers

Preferred

Azure ML pipelines; time‑series forecasting; root cause analysis frameworks
Great Expectations/Monte Carlo; OpenLineage; Purview; Fabric Semantic Models
Event‑driven patterns (Event Grid/Service Bus), streaming (Kafka/Event Hubs)
Experience in operations/financial services/valuations domains

Company

ServiceLink

company-logo
ServiceLink offers asset management, insurance, and mortgage loan services. It is a sub-organization of Fidelity National Financial.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Chris Azur
Chief Executive Officer
linkedin
leader-logo
David Holland
Chief Financial Officer
linkedin
Company data provided by crunchbase