Summit Tech Partners · 10 hours ago
Data Engineer
Summit Tech Partners is seeking a Data Engineer to design, build, and maintain data pipelines and platforms that support large-scale analytics and data-driven applications. The role involves working with modern data technologies and cloud platforms to ensure reliable and scalable data delivery across the organization.
Responsibilities
Develop and maintain scalable ETL/ELT pipelines using SQL, Python, and modern data frameworks
Work with distributed data processing tools such as Spark, Hive, and Airflow
Build and optimize data workflows across structured and unstructured data sources
Collaborate with engineering and analytics teams to design robust data architectures
Implement data quality checks, validation processes, and governance standards
Integrate streaming and messaging technologies into data pipelines
Support cloud-based data solutions and contribute to platform modernization efforts
Optimize performance for large-scale datasets and high-volume processing
Qualification
Required
Strong proficiency in SQL
Experience with Python for data processing
Understanding of ETL concepts and data transformation workflows
Preferred
Familiarity with the Hadoop ecosystem and distributed data processing
Experience with Kafka or other streaming platforms
Hands-on experience with Spark, Hive, or Airflow
Knowledge of cloud data platforms (AWS, GCP, or Azure)
Experience working with NoSQL databases
Understanding of big data architecture and scalable system design
Experience with performance tuning for large-scale data systems
Background in data governance, security practices, and compliance
Exposure to AI-driven data processing or intelligent automation