APN Consulting, Inc. · 5 hours ago
Lead Data Engineer
APN Consulting, Inc. is a progressive IT staffing and services company offering innovative business solutions. They are seeking a Lead Data Engineer to enhance security and operational efficiency through advanced analytic techniques and robust ETL pipeline construction.
ConsultingFacility ManagementHealth CareInformation TechnologyIT ManagementService IndustryStaffing Agency
Responsibilities
Design, develop, and maintain scalable ETL pipelines to ensure data quality and availability
Implement monitoring and alerting solutions to ensure data pipeline reliability and performance
Develop and manage deployment pipelines to facilitate continuous integration and delivery of data engineering solutions
Implement data integration solutions to support analytics and reporting needs
Execute the complete analytics lifecycle for problem solving, including:
Algorithm traditionalization
Model validation
Model prototyping
Data exploration
Data grooming
Survey varied data sources for analytic relevance, including:
External sources accessed via API
Flat files
Relational databases
Distributed file systems
Interpret, synthesize and communicate results of analyses to effect action and changes within the organization
Collaborate across teams to integrate analytic products with existing production architecture, develop, execute, and evaluate courses of action, and socialize results
Help teach and explain techniques and tools used to a broad set of business
Qualification
Required
Minimum years of experience: 8-10 years
Ability to read, write, speak and understand English
Strong communication and presentation skills
Expertise in data engineering languages such as Scala (preferred) or Java, with proficiency in Python
Experience with BigData tools, particularly Spark
Proficiency in building and managing ETL pipelines
Expert-level quantitative analysis skills including interpretation of model results, consideration of causality, treatment of multicollinearity
The ability to work in compiled, high-performance languages (e.g., Scala, Java, C++)
Experience with relational databases
Strong understanding of relational databases and SQL, and familiarity with NoSQL databases
Broad experience and solid theoretical foundation on the modeling process using a variety of algorithmic techniques, including Machine Learning, and Graph/Network Analytics
Data pre-processing, exploratory data analysis using a variety of techniques
Basic understanding of data architecture, data warehouse, and data marts
Demonstrated ability and desire to continually expand skill set, and learn from and teach others
Preferred
Experience with Linux-based operating systems
Knowledge of other relevant techniques such as text analysis and text mining
Bachelors degree (or relevant experience) in computer science, mathematics, statistics, physics, operations research, or other quantitatively-focused fields
Masters: 6+ Years Work Experience
Bachelors: 8+ Years Work Experience