Steampunk, Inc. · 4 months ago
Data Engineer - Databricks
Steampunk, Inc. is focused on creating and executing comprehensive data strategies for their clients. They are seeking a seasoned Data Engineer to develop enterprise-grade data platforms and pipelines in Databricks, while working with a team to solve complex data problems.
ConsultingInformation Technology
Responsibilities
Lead and architect migrations of data using Databricks with focus on performance, reliability, and scalability
Assess and understand ETL jobs, workflows, data marts, BI tools, and reports
Address technical inquiries concerning customization, integration, enterprise architecture and general feature/functionality of data products
Experience working with database/data warehouse/data mart solutions in cloud (Preferably AWS. Alternatively Azure, GCP)
Key must have skill sets – Databricks, SQL, PySpark / Python, AWS
Support an Agile software development lifecycle
You will contribute to the growth of our AI & Data Exploitation Practice!
Qualification
Required
Ability to hold a position of public trust with the US government
2-4 years industry experience coding commercial software and a passion for solving complex problems
2-4 years direct experience in Data Engineering with experience in tools such as:
Big data tools: Databricks, Apache Spark, Delta Lake, etc
Relational SQL (Preferably T-SQL. Alternatively pgSQL, MySQL)
Data pipeline and workflow management tools: Databricks Workflows, Airflow, Step Functions, etc
AWS cloud services: Databricks on AWS, S3, EC2, RDS (or Azure equivalents)
Object-oriented/object function scripting languages: PySpark/Python, Java, C++, Scala, etc
Experience working with Data Lake house architecture and Delta Lake/Apache Iceberg
Advanced working SQL knowledge and experience working with relational databases, query authoring and optimization (SQL) as well as working familiarity with a variety of databases
Experience manipulating, processing, and extracting value from large, disconnected datasets
Ability to inspect existing data pipelines, discern their purpose and functionality, and re-implement them efficiently in Databricks
Experience manipulating structured and unstructured data
Experience architecting data systems (transactional and warehouses)
Experience the SDLC, CI/CD, and operating in dev/test/prod environments
Commitment to data governance
Experience working in an Agile environment
Experience supporting project teams of developers and data scientists who build web-based interfaces, dashboards, reports, and analytics/machine learning models
Preferred
Experience with data cataloging tools such as Informatica EDC, Unity Catalog, Collibra, Alation, Purview, or DataZone is a plus
Company
Steampunk, Inc.
Steampunk is anchored by a startup culture with a customer-centered delivery approach, we put our Federal government clients in the center of everything we design, develop, and deliver to drive high-quality mission impacts and user experiences at speed.
Funding
Current Stage
Growth StageTotal Funding
unknownKey Investors
AcceliCITY powered by Leading Cities
2024-07-31Non Equity Assistance
Recent News
Washington Technology
2025-10-01
2024-05-21
Company data provided by crunchbase