Apply on Employer Site

eTeam · 8 hours ago

Senior Data Engineer (PySpark / AWS Big Data)

Richardson, TX

Full-time

Onsite

Senior Level, Lead/Staff

8+ years exp

eTeam is seeking a Senior Data Engineer with expertise in PySpark and AWS Big Data. The role involves designing and building robust ETL/ELT pipelines, optimizing data processing, and ensuring data quality and consistency across ingestion flows.

Information Technology

Growth Opportunities

H1B Sponsor Likely

Responsibilities

Design and build robust, scalable ETL/ELT pipelines using Pyspark to ingest data from diverse sources (databases, logs, APIs, files)

Transform and curate raw transactional and log data into analysis-ready datasets in the Data Hub and analytical data marts

Develop reusable and parameterized Spark jobs for batch and micro-batch processing

Optimize performance and scalability of Pyspark jobs across large data volumes

Ensure data quality, consistency, lineage, and proper documentation across ingestion flows

Collaborate with Data Architects, Modelers, and Data Scientists to implement ingestion logic aligned with business needs

Work with cloud-based data platforms (e.g., AWS S3, Glue, EMR, Redshift) for data movement and storage

Support version control, CI/CD, and infrastructure-as-code where applicable

Participate in Agile ceremonies and contribute to sprint planning, story grooming, and demos

Qualification

PySparkAWSData EngineeringSQLData PipelinesData Lake/WarehousePythonData GovernanceDevOpsAgile

Required

4+ years of experience in data engineering, with strong focus on PySpark/Spark for big data processing

Expertise in building data pipelines and ingestion frameworks from relational, semi-structured (JSON, XML), and unstructured sources (logs, PDFs)

Proficiency in Python with strong knowledge of data processing libraries

Strong SQL skills for querying and validating data in platforms like Amazon Redshift, PostgreSQL, or similar

Experience with distributed computing frameworks (e.g., Spark on EMR, Databricks)

Familiarity with workflow orchestration tools (e.g., AWS Step Functions, or similar)

Solid understanding of data lake / data warehouse architectures and data modeling basics

Minimum Years of Experience: 8-15+

Certifications Needed: YES. AWS

Preferred

Experience with AWS data services: Glue, S3, Redshift, Lambda, CloudWatch, etc

Familiarity with Delta Lake or similar for large-scale data storage

Exposure to real-time streaming frameworks (e.g., Spark Structured Streaming, Kafka)

Knowledge of data governance, lineage, and cataloging tools (e.g., AWS Glue Catalog, Apache Atlas)

Understanding of DevOps/CI-CD pipelines for data projects using Git, Jenkins, or similar tools

Company

eTeam

Glassdoor4.0

eTeam is a staffing agency that also provides payrolling services.

Founded in 1999

Somerset, New Jersey, USA

501-1000 employees

http://www.eteaminc.com

H1B Sponsorship

eTeam has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (36)

2024 (205)

2023 (11)

2022 (7)

2021 (24)

2020 (25)

Funding

Current Stage

Late Stage

Total Funding

unknown

2023-12-04Acquired

Leadership Team

Swetta Bhatt

CEO APAC & India

Aanchal Thakur

Chief Customer Officer

Recent News

PR Newswire

Monument Consulting Announces 2025 Strategic Suppliers

2025-11-19

EIN Presswire

Cynthia Schminke Recognized by Influential Women in 2025

2025-08-21

Team, Inc.

Team, Inc. Announces Executive Promotion to Lead and Accelerate Transformation Effort

2025-07-25

Company data provided by crunchbase