Data Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Randstad Digital Americas · 22 hours ago

Data Engineer

Randstad Digital Americas is seeking a Data Engineer to support the Middle Office Digital Activity team. The role involves building, enhancing, and maintaining scalable data products and pipelines on AWS, focusing on transforming and delivering reliable datasets using SQL, Python, and PySpark.

Information Technology & Services

Responsibilities

The contractor will support the Middle Office Digital Activity team by building, enhancing, and maintaining scalable data products and pipelines on AWS
The role focuses on transforming and delivering reliable datasets using SQL, Python, and PySpark, with AWS Glue as the primary ETL tool
Robust proficiency in writing complex, performant queries
Experience with analytical datasets and data transformations
Solid experience building data pipelines and reusable modules
Familiarity with data processing libraries and structured code practices
Hands-on experience developing PySpark jobs for large-scale data processing
Understanding of distributed data processing concepts
AWS Glue (primary ETL tool - required)
Experience working with S3-based data lakes
Familiarity with IAM, job scheduling, and monitoring in AWS
ETL/ELT design patterns, Data Modelling Techniques
Data quality checks and basic validation frameworks
Understanding of partitioning, schema evolution, and performance optimization
Experience with Redshift or any other MPP database
Knowledge of Iceberg, or other open table formats
Exposure to CI/CD for data pipelines
Experience supporting event-based design and systems preferably Kafka
Familiarity with data governance, lineage, or cataloging tools

Qualification

SQLPythonPySparkAWS GlueData Engineering FundamentalsRedshiftCI/CDKafkaData Governance

Required

Robust proficiency in writing complex, performant queries
Experience with analytical datasets and data transformations
Solid experience building data pipelines and reusable modules
Familiarity with data processing libraries and structured code practices
Hands-on experience developing PySpark jobs for large-scale data processing
Understanding of distributed data processing concepts
AWS Glue (primary ETL tool - required)
Experience working with S3-based data lakes
Familiarity with IAM, job scheduling, and monitoring in AWS
ETL/ELT design patterns, Data Modelling Techniques
Data quality checks and basic validation frameworks
Understanding of partitioning, schema evolution, and performance optimization

Preferred

Experience with Redshift or any other MPP database
Knowledge of Iceberg, or other open table formats
Exposure to CI/CD for data pipelines
Experience supporting event-based design and systems preferably Kafka
Familiarity with data governance, lineage, or cataloging tools

Benefits

Medical
Prescription
Dental
Vision
AD&D
Life insurance offerings
Short-term disability
401K plan

Company

Randstad Digital Americas

twitter
company-logo
Randstad Digital is a trusted digital enablement partner that facilitates accelerated transformation for businesses by providing global talent, capacity, and solutions across specialized domains.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Graig Paglieri
CEO, Randstad Digital Americas
linkedin
leader-logo
Pascal de Hesselle
SVP, Executive Client Partner - Technology, Media & Telecom
linkedin
Company data provided by crunchbase