Enformion ยท 4 hours ago
Senior Data Engineer - ENF
Enformion is a dynamic and innovative data and analytics company that assists digital marketplaces in fraud prevention and risk management. They are seeking a Senior Data Engineer to build a modern data processing platform using Spark, EMR, and various databases, while improving overall data quality and infrastructure scalability.
AnalyticsInformation Technology
Responsibilities
Implement and maintain big data platform and infrastructure
Develop, optimize and tune MySQL stored procedures, scripts, and indexes
Develop Hive schemas and scripts, Spark Jobs using pyspark and Scala and UDFs in Java
Design, develop and maintain automated, complex, and efficient ETL processes to do batch records-matching of multiple large-scale datasets, including supporting documentation
Develop and maintains pipelines using Airflow or any other tools to monitor, debug, and analyze data pipelines
Troubleshoot Hadoop cluster and query issues, evaluate query plans, and optimize schemas and queries
Strong interpersonal skills to resolve problems in a professional manner, lead working groups, and negotiate consensus
Qualification
Required
BS, MS, or PhD in Computer Science or related field
5+ years minimum experience in language such as Java, Scala, PySpark, Perl, Shell Scripting and Python
Working knowledge of the Hadoop ecosystem applications (MapReduce, YARN, Pig, Hbase, Hive, Spark and more!)
Strong Experience working with data pipelines in multi-terabyte data warehouses. Experience in dealing with performance and scalability issues
Strong SQL (MySQL, Hive, etc.) and No-SQL (MongoDB, Hbase, etc.) skills, including writing complex queries and performance tuning
Knowledge of data modeling, partitioning, indexing, and architectural database design
Experience using Source Code and Version Control systems like GIT etc
Experience on continuous build and test process using tools such as GitLab, SBT, Postman, etc
Experience with Search Engines, Name/Address Matching, or Linux text processing
Preferred
Knowledge of cluster configuration, Hadoop administration and performance tuning are a huge plus
Distributed computing principles and experience in big data technologies including performance tuning
Machine Learning
Company
Enformion
Enformion is designed to meet the advanced data and research needs of business and government professionals.
Funding
Current Stage
Growth StageRecent News
MarTech Breakthrough
2025-08-14
Company data provided by crunchbase