Apply on Employer Site

Randstad Digital Americas · 22 hours ago

Data Engineer

Malvern, PA

Full-time

Onsite

Senior Level, Lead/Staff

$51/hr - $56/hr

8+ years exp

Randstad Digital Americas is seeking a Data Engineer to support the Middle Office Digital Activity team. The role involves building, enhancing, and maintaining scalable data products and pipelines on AWS, focusing on transforming and delivering reliable datasets using SQL, Python, and PySpark.

Information Technology & Services

Responsibilities

The contractor will support the Middle Office Digital Activity team by building, enhancing, and maintaining scalable data products and pipelines on AWS

The role focuses on transforming and delivering reliable datasets using SQL, Python, and PySpark, with AWS Glue as the primary ETL tool

Robust proficiency in writing complex, performant queries

Experience with analytical datasets and data transformations

Solid experience building data pipelines and reusable modules

Familiarity with data processing libraries and structured code practices

Hands-on experience developing PySpark jobs for large-scale data processing

Understanding of distributed data processing concepts

AWS Glue (primary ETL tool - required)

Experience working with S3-based data lakes

Familiarity with IAM, job scheduling, and monitoring in AWS

ETL/ELT design patterns, Data Modelling Techniques

Data quality checks and basic validation frameworks

Understanding of partitioning, schema evolution, and performance optimization

Experience with Redshift or any other MPP database

Knowledge of Iceberg, or other open table formats

Exposure to CI/CD for data pipelines

Experience supporting event-based design and systems preferably Kafka

Familiarity with data governance, lineage, or cataloging tools

Qualification

SQLPythonPySparkAWS GlueData Engineering FundamentalsRedshiftCI/CDKafkaData Governance

Required

Robust proficiency in writing complex, performant queries

Experience with analytical datasets and data transformations

Solid experience building data pipelines and reusable modules

Familiarity with data processing libraries and structured code practices

Hands-on experience developing PySpark jobs for large-scale data processing

Understanding of distributed data processing concepts

AWS Glue (primary ETL tool - required)

Experience working with S3-based data lakes

Familiarity with IAM, job scheduling, and monitoring in AWS

ETL/ELT design patterns, Data Modelling Techniques

Data quality checks and basic validation frameworks

Understanding of partitioning, schema evolution, and performance optimization

Preferred

Experience with Redshift or any other MPP database

Knowledge of Iceberg, or other open table formats

Exposure to CI/CD for data pipelines

Experience supporting event-based design and systems preferably Kafka

Familiarity with data governance, lineage, or cataloging tools

Benefits

Medical

Prescription

Dental

Vision

AD&D

Life insurance offerings

Short-term disability

401K plan

Company

Randstad Digital Americas

Randstad Digital is a trusted digital enablement partner that facilitates accelerated transformation for businesses by providing global talent, capacity, and solutions across specialized domains.

Founded in 1984

Atlanta, GA, US

10001+ employees

https://www.randstaddigital.com/

Funding

Current Stage

Late Stage

Leadership Team

Graig Paglieri

CEO, Randstad Digital Americas

Pascal de Hesselle

SVP, Executive Client Partner - Technology, Media & Telecom

Company data provided by crunchbase