Wolf Greenfield · 2 weeks ago
Data Science Engineer
Wolf Greenfield is seeking a versatile Data Science Engineer to join their team. The role involves designing, building, and operationalizing advanced data-driven solutions that support business decision-making and product innovation, with a focus on data lifecycle management, machine learning model development, and collaboration with cross-functional teams.
AdviceCommunitiesLegal
Responsibilities
Uses Data Pipeline Development to design, build, and maintain enterprise-scale ETL/ELT pipelines using Azure Data Factory and Fabric Data Factory
Leverages Microsoft Fabric (OneLake, Lakehouse, and Warehouse) to unify disparate data sources for downstream science workloads to build and optimize data workflows
Uses Model Engineering to develop, train, and tune machine learning models using Synapse Data Science (Notebooks) and MLflow while monitoring performance, data drift, and system reliability
Uses Feature Engineering to transform raw data into curated datasets using PySpark and SQL to optimize model performance
Implements CI/CD patterns for machine learning, ensuring models are versioned, monitored, and easily redeployed
Implements data quality checks, monitoring, and validation processes to ensure data integrity
Performs exploratory data analysis to uncover trends, patterns, and actionable insights
Translates business data into quantitative frameworks and measurable outcomes
Communicates findings and model performance for both technical and non-technical stakeholders
Optimizes query performance and data processing workflows for efficiency and cost-effectiveness
Creates and maintains technical documentation for data pipelines, models, and processes
Creates, updates, and secures Fabric items, specifically Lakehouses, Warehouses, Notebooks, and Dataflows Gen2 within the OneLake platform
Manages data orchestration using advanced knowledge of Azure Data Factory pipelines, activities, triggers, and Self-hosted Integration Runtimes
Utilizes a strong command of Python (Pandas, Scikit-learn, PySpark) and SQL to create dataflows and notebooks for ingestion and analytics
Implements Star Schema and Medallion Architecture (Bronze/Silver/Gold) principles to ensure data scalability
Participates in code reviews and contributes to the evolution of best practices for the team
Qualification
Required
Minimum of three (3+) of work experience as a data science engineer with a proven track record
Proven hands-on experience with the Azure Data Stack (ADLS Gen2, Azure SQL, Key Vault)
Proficiency in SQL, Python, and PySpark
Experience with data pipeline development and AI/ML applications
Knowledge of data warehousing, data lake architecture, and Microsoft Fabric platform
Familiarity with machine learning, AI, and Generative AI technologies
Strong analytical, problem-solving, and communication skills
Ability to work collaboratively in a fast-paced, cross-functional environment
Preferred
Bachelor's degree or equivalent in Computer Science or a related field preferred
Experience with Power BI for visualizing model outputs and creating user dashboards
Familiarity with Azure DevOps or GitHub Actions for automated deployments
Microsoft Certifications (e.g., DP-600: Fabric Analytics Engineer or DP-700: Fabric Data Engineer)
Benefits
Multiple health care plan options
Vision and dental insurance
Flexible spending accounts/health saving accounts
Life insurance
Employer sponsored 401(k) plan includes an employer match and discretionary profit sharing
Tuition reimbursement
Professional development opportunities
Generous paid time off
Sick time and vacation time
Parental leave
Commuter benefits
Charitable matching gift program
Well-being support
Company
Wolf Greenfield
Wolf Greenfield is the largest Boston law firm devoted exclusively to intellectual property law.
Funding
Current Stage
Growth StageLeadership Team
Recent News
GlobeNewswire
2026-01-09
2025-01-02
Company data provided by crunchbase