Complete Staffing Solutions · 3 days ago
Data Science Engineer
Complete Staffing Solutions is seeking a versatile Data Science Engineer to join their team. The role involves designing, building, and operationalizing advanced data-driven solutions to support business decision-making and product innovation, while collaborating with cross-functional teams to ensure a robust data infrastructure.
CRMHuman ResourcesInformation TechnologyStaffing Agency
Responsibilities
Uses Data Pipeline Development to design, build, and maintain enterprise-scale ETL/ELT pipelines using Azure Data Factory and Fabric Data Factory
Leverages Microsoft Fabric (OneLake, Lakehouse, and Warehouse) to unify disparate data sources for downstream science workloads to build and optimize data workflows
Uses Model Engineering to develop, train, and tune machine learning models using Synapse Data Science (Notebooks) and MLflow while monitoring performance, data drift, and system reliability
Uses Feature Engineering to transform raw data into curated datasets using PySpark and SQL to optimize model performance
Implements CI/CD patterns for machine learning, ensuring models are versioned, monitored, and easily redeployed
Implements data quality checks, monitoring, and validation processes to ensure data integrity
Performs exploratory data analysis to uncover trends, patterns, and actionable insights
Translates business data into quantitative frameworks and measurable outcomes
Communicates findings and model performance for both technical and non-technical stakeholders
Optimizes query performance and data processing workflows for efficiency and cost-effectiveness
Creates and maintains technical documentation for data pipelines, models, and processes
Fabric Management: Creates, updates, and secures Fabric items, specifically Lakehouses, Warehouses, Notebooks, and Dataflows Gen2 within the OneLake platform
Advanced Orchestration: Manages data orchestration using advanced knowledge of Azure Data Factory pipelines, activities, triggers, and Self-hosted Integration Runtimes
Data Manipulation: Utilizes a strong command of Python (Pandas, Scikit-learn, PySpark) and SQL to create dataflows and notebooks for ingestion and analytics
Architecture Best Practices: Implements Star Schema and Medallion Architecture (Bronze/Silver/Gold) principles to ensure data scalability
Quality Assurance: Participates in code reviews and contributes to the evolution of best practices for the team
Qualification
Required
Bachelor's degree or equivalent in Computer Science or a related field preferred
Minimum of three (3+) of work experience as a data science engineer with a proven track record
Proven hands-on experience with the Azure Data Stack (ADLS Gen2, Azure SQL, Key Vault)
Experience with Power BI for visualizing model outputs and creating user dashboards
Proficiency in SQL, Python, and PySpark
Experience with data pipeline development and AI/ML applications
Knowledge of data warehousing, data lake architecture, and Microsoft Fabric platform
Familiarity with machine learning, AI, and Generative AI technologies
Strong analytical, problem-solving, and communication skills
Ability to work collaboratively in a fast-paced, cross-functional environment
Familiarity with Azure DevOps or GitHub Actions for automated deployments
Preferred
Microsoft Certifications (e.g., DP-600: Fabric Analytics Engineer or DP-700: Fabric Data Engineer)
Company
Complete Staffing Solutions
Complete Staffing Solutions is a staffing and recruiting company that offers finance, accounting, and administrative staffing solutions.