Collabera · 16 hours ago
Data Engineer
Collabera is in the Banking Industry, and they are seeking a Lead Data Engineer to support the development of customer-centric strategies. The role involves managing cloud resources, data ingestion and processing, and ensuring data quality for automated customer communications.
Responsibilities
Create and manage cloud resources in AWS
Data ingestion from different data sources which exposes data using different technologies, such as: RDBMS, REST HTTP API, flat files, Streams, and Time series data based on various proprietary systems. Implement data ingestion and processing with the help of Big Data technologies
Data processing/transformation using various technologies such as Spark and Cloud Services. You will need to understand your part of business logic and implement it using the language supported by the base data platform
Develop automated data quality check to make sure right data enters the platform and verifying the results of the calculations
Develop an infrastructure to collect, transform, combine and publish/distribute customer data
Define process improvement opportunities to optimize data collection, insights and displays
Ensure data and results are accessible, scalable, efficient, accurate, complete and flexible
Identify and interpret trends and patterns from complex data sets
Construct a framework utilizing data visualization tools and techniques to present consolidated analytical and actionable results to relevant stakeholders
Key participant in regular Scrum ceremonies with the agile teams
Proficient at developing queries, writing reports and presenting findings
Mentor junior members and bring best industry practices
Qualification
Required
5-7+ years' experience as data engineer in consumer finance or equivalent industry (consumer loans, collections, servicing, optional product, and insurance sales)
Strong background in math, statistics, computer science, data science or related discipline
Advanced knowledge one of language: Java, Scala, Python, C#
Production experience with: HDFS, YARN, Hive, Spark, Kafka, Oozie / Airflow, (AWS), Docker / Kubernetes, Snowflake
Proficient with data mining/programming tools (e.g. SAS, SQL, R, Python)
Proficient with database technologies (e.g. PostgreSQL, Redshift, Snowflake. and Greenplum)
Proficient with data visualization (e.g. Tableau, Looker, MicroStrategy)
Comfortable learning about and deploying new technologies and tools
Organizational skills and the ability to handle multiple projects and priorities simultaneously and meet established deadlines
Good written and oral communication skills and ability to present results to non-technical audiences
Knowledge of business intelligence and analytical tools, technologies and techniques
Preferred
AWS certification
Spark Streaming
Kafka Streaming / Kafka Connect
ELK Stack
Cassandra / MongoDB
CI/CD: Jenkins, GitLab, Jira, Confluence other related tools
Benefits
Medical insurance
Dental insurance
Vision insurance
401(k) retirement plan
Life insurance
Long-term disability insurance
Short-term disability insurance
Paid parking/public transportation
Paid time off
Paid sick and safe time
Hours of paid vacation time
Weeks of paid parental leave
Paid holidays annually - as applicable.
Company
Collabera
Collabera is an end-to-end information technology services and solutions provider helping clients align their business and IT strategies.
H1B Sponsorship
Collabera has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (141)
2024 (93)
2023 (120)
2022 (186)
2021 (180)
2020 (146)
Funding
Current Stage
Late StageTotal Funding
$30M2006-05-04Series Unknown· $30M
Recent News
2024-04-09
2023-01-20
2022-04-19
Company data provided by crunchbase