Apply on Employer Site

Ampstek · 19 hours ago

Only USC/GC :: Lead Data Engineer

Englewood, CO

Contract

Onsite

Lead/Staff

Ampstek is seeking a Lead Data Engineer to design, develop, and maintain scalable ETL pipelines. The role involves implementing monitoring solutions, managing deployment pipelines, and collaborating across teams to integrate analytic products with existing architecture.

IT Management

Growth Opportunities

No H1B

U.S. Citizen Only

Hiring Manager

David Wilfred

Responsibilities

Design, develop, and maintain scalable ETL pipelines to ensure data quality and availability

Implement monitoring and alerting solutions to ensure data pipeline reliability and performance

Develop and manage deployment pipelines to facilitate continuous integration and delivery of data engineering solutions

Implement data integration solutions to support analytics and reporting needs

Execute the complete analytics lifecycle for problem solving, including:

Algorithm traditionalization

Model validation

Model prototyping

Data exploration

Data grooming

Survey varied data sources for analytic relevance, including:

External sources accessed via API

Flat files

Relational databases

Distributed file systems

Interpret, synthesize and communicate results of analyses to effect action and changes within the organization

Collaborate across teams to integrate analytic products with existing production architecture, develop, execute, and evaluate courses of action, and socialize results

Help teach and explain techniques and tools used to a broad set of business

Expertise in data engineering languages such as Scala (preferred) or Java, with proficiency in Python

Experience with BigData tools, particularly Spark

Proficiency in building and managing ETL pipelines

Expert-level quantitative analysis skills including interpretation of model results, consideration of causality, treatment of multicollinearity

The ability to work in compiled, high-performance languages (e.g., Scala, Java, C++)

Experience with relational databases

Strong understanding of relational databases and SQL, and familiarity with NoSQL databases

Broad experience and solid theoretical foundation on the modeling process using a

Variety of algorithmic techniques, including Machine Learning, and Graph/Network Analytics

Data pre-processing, exploratory data analysis using a variety of techniques

Basic understanding of data architecture, data warehouse, and data marts

Demonstrated ability and desire to continually expand skill set, and learn from and teach others

Qualification

ETLPythonAWSML OPSData warehousingBigData toolsSQLScalaJavaC++NoSQLGraph AnalyticsData pre-processingExploratory data analysis

Required

ETL

ML OPS

AI-ML

Data warehousing

Python

AWS