Apply on Employer Site

eHub Global Inc · 1 day ago

Data Engineer

NYC Metro Area

Contract

Onsite

Senior Level, Lead/Staff

12+ years exp

eHub Global Inc is seeking a highly skilled Data Engineer with deep expertise in Big Data technologies, data lakes, and modern analytics platforms. The ideal candidate will design, build, and optimize scalable data pipelines that support advanced analytics and business intelligence.

EmploymentStaffing AgencyTraining

Hiring Manager

Divyansh Chaudhary

Responsibilities

Design, develop, and maintain data pipelines for ingesting, transforming, and delivering large-scale datasets

Manage and optimize data lake architectures to ensure scalability, reliability, and performance

Implement and support Hadoop-based solutions for distributed data processing

Integrate and manage Snowflake for cloud-based data warehousing and analytics

Build and maintain real-time streaming solutions using Kafka

Develop and optimize Spark applications for batch and streaming workloads

Collaborate with data analysts, scientists, and business stakeholders to deliver actionable insights

Ensure data quality, governance, and security across all platforms

Monitor and troubleshoot data pipelines to maintain high availability and performance

Qualification

Big Data EcosystemCloud Data WarehousingData PipelinesProgramming & ScriptingData Lake ManagementData AnalysisDistributed SystemsDevOps & AutomationVisualization & BI ToolsProblem-solvingCommunicationCollaboration

Required

Deep expertise in Big Data technologies, data lakes, and modern analytics platforms

Strong hands-on experience with Hadoop ecosystems, Snowflake, Kafka, Spark, and other distributed data platforms

Design, develop, and maintain data pipelines for ingesting, transforming, and delivering large-scale datasets

Manage and optimize data lake architectures to ensure scalability, reliability, and performance

Implement and support Hadoop-based solutions for distributed data processing

Integrate and manage Snowflake for cloud-based data warehousing and analytics

Build and maintain real-time streaming solutions using Kafka

Develop and optimize Spark applications for batch and streaming workloads

Collaborate with data analysts, scientists, and business stakeholders to deliver actionable insights

Ensure data quality, governance, and security across all platforms

Monitor and troubleshoot data pipelines to maintain high availability and performance

Big Data Ecosystem: Hadoop (HDFS, Hive, Pig, MapReduce), Spark, Kafka

Cloud Data Warehousing: Snowflake (preferred), Redshift, BigQuery

Data Lake Management: Experience with large-scale data storage and retrieval

Data Pipelines: ETL/ELT design, orchestration tools (Airflow, NiFi, etc.)

Programming & Scripting: Python, Scala, Java, SQL

Data Analysis: Strong ability to query, analyze, and interpret large datasets

Distributed Systems: Understanding of scalability, fault tolerance, and performance optimization

DevOps & Automation: CI/CD pipelines, containerization (Docker, Kubernetes)

Visualization & BI Tools: Familiarity with Tableau, Power BI, or similar

Preferred

12+ years of experience in data engineering or big data roles

Experience with cloud platforms (AWS, Azure, GCP)

Strong problem-solving and analytical mindset

Excellent communication and collaboration skills

Company

eHub Global Inc

Custom Workforce & HR solutions Services.

Founded in 2017

Dallas, Texas, USA

51-200 employees

https://www.ehub.global

Funding

Current Stage

Growth Stage

Recent News

citybiz

VenHub Expands Retail Innovation with Launch of 24/7 Smart Store in Hollywood

2025-07-12

Thailand Business News

VenHub Launches 24/7 Autonomous Smart Store in Hollywood, Expanding AI Retail in One of LA’s Busiest Districts

2025-07-01

Newsfile

VenHub Launches 24/7 AI-Powered Smart Store at Metro Transit Center at LAX, Leading the Next Era of Autonomous Retail in Travel and Transportation

2025-06-07

Company data provided by crunchbase