micro1 · 17 hours ago
Data Engineer
Maximize your interview chances
Insider Connection @micro1
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
Design and implement data pipelines to efficiently process and manage large datasets.
Develop robust and scalable data storage solutions using databases and data lakes.
Collaborate with stakeholders to understand data requirements and optimize data architecture.
Utilize distributed computing frameworks like Apache Spark or Dask to handle large data processing tasks.
Ensure data quality and governance by implementing best practices in data management.
Conduct data preprocessing and transformation for machine learning model readiness.
Leverage cloud platforms like AWS, GCP, or Azure for data processing and storage.
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
Proficiency in Python for data processing and software engineering tasks.
Strong knowledge of SQL for querying and manipulating large datasets.
Experience with distributed computing frameworks such as Apache Spark or Dask.
Expertise in designing scalable data architectures and pipelines.
Familiarity with cloud-based data platforms like AWS, Google Cloud, or Azure.
Understanding of data governance and ensuring data accuracy and quality.
Knowledge of tools/frameworks for ETL processes.
Preferred
Familiarity with big data technologies such as Hadoop, Kafka, Apache Flink, and Airflow.
Experience in machine learning principles and data transformation for AI model training.
Strong written and verbal communication skills to collaborate effectively in a remote setting.
Company
micro1
AI recruitment engine to hire top global talent
Funding
Current Stage
Growth StageTotal Funding
$6.6M2024-06-20Seed· $3.3M
2023-07-07Pre Seed· $3.3M
Recent News
EIN Presswire
2024-11-12
2024-05-21
2024-04-03
Company data provided by crunchbase