Randstad Digital Americas · 15 hours ago
Data Engineer
Randstad Digital Americas is seeking a Data Engineer to support the Middle Office Digital Activity team. The role involves building, enhancing, and maintaining scalable data products and pipelines on AWS, focusing on transforming and delivering reliable datasets using SQL, Python, and PySpark.
Information Technology & Services
Responsibilities
The contractor will support the Middle Office Digital Activity team by building, enhancing, and maintaining scalable data products and pipelines on AWS
The role focuses on transforming and delivering reliable datasets using SQL, Python, and PySpark, with AWS Glue as the primary ETL tool
Robust proficiency in writing complex, performant queries
Experience with analytical datasets and data transformations
Solid experience building data pipelines and reusable modules
Familiarity with data processing libraries and structured code practices
Hands-on experience developing PySpark jobs for large-scale data processing
Understanding of distributed data processing concepts
AWS Glue (primary ETL tool - required)
Experience working with S3-based data lakes
Familiarity with IAM, job scheduling, and monitoring in AWS
ETL/ELT design patterns, Data Modelling Techniques
Data quality checks and basic validation frameworks
Understanding of partitioning, schema evolution, and performance optimization
Experience with Redshift or any other MPP database
Knowledge of Iceberg, or other open table formats
Exposure to CI/CD for data pipelines
Experience supporting event-based design and systems preferably Kafka
Familiarity with data governance, lineage, or cataloging tools
Qualification
Required
Robust proficiency in writing complex, performant queries
Experience with analytical datasets and data transformations
Solid experience building data pipelines and reusable modules
Familiarity with data processing libraries and structured code practices
Hands-on experience developing PySpark jobs for large-scale data processing
Understanding of distributed data processing concepts
AWS Glue (primary ETL tool - required)
Experience working with S3-based data lakes
Familiarity with IAM, job scheduling, and monitoring in AWS
ETL/ELT design patterns, Data Modelling Techniques
Data quality checks and basic validation frameworks
Understanding of partitioning, schema evolution, and performance optimization
Preferred
Experience with Redshift or any other MPP database
Knowledge of Iceberg, or other open table formats
Exposure to CI/CD for data pipelines
Experience supporting event-based design and systems preferably Kafka
Familiarity with data governance, lineage, or cataloging tools
Benefits
Medical
Prescription
Dental
Vision
AD&D
Life insurance offerings
Short-term disability
401K plan
Company
Randstad Digital Americas
Randstad Digital is a trusted digital enablement partner that facilitates accelerated transformation for businesses by providing global talent, capacity, and solutions across specialized domains.