Zeektek · 17 hours ago
Databrick Engineer - Python, Pyspark, Apache Spark, SQL
Zeektek is seeking a skilled Data Engineer to design, build, and optimize scalable data pipelines and automation for analytics solutions in a modern cloud-based environment. The role involves developing reliable data workflows using Spark, Python, SQL, and Databricks, while collaborating closely with data scientists and analytics teams.
EmploymentHuman ResourcesRecruiting
Responsibilities
Design, develop, and maintain scalable data pipelines using Apache Spark and Databricks
Build and optimize data transformations using Python, PySpark, and SQL
Ensure data quality, reliability, and performance across batch and streaming workloads
Apply strong software engineering best practices, including unit testing, integration testing, and code reviews
Manage source control using GitHub and participate in CI/CD workflows
Collaborate with cross-functional teams to support analytics and ML use cases
Troubleshoot and resolve data pipeline and performance issues
Qualification
Required
3 or more years of experience in Spark
3 or more years of experience in Python
3 or more years of experience in SQL
5 or more years of experience in software engineering (unit tests, integration tests, GitHub, dependency management, CI/CD)
3 or more years of experience in Databricks
Healthcare Data background
Automation
Preferred
3 or more years of experience with PySpark
Git and MLOps best practices
Company
Zeektek
Zeektek is an IT recruiting and solutions firm.
Funding
Current Stage
Early StageRecent News
2023-08-17
Business Journals
2022-12-05
Company data provided by crunchbase