Dexian IT Solutions · 20 hours ago
Machine Learning Data Engineer #939106
Maximize your interview chances
Insider Connection @Dexian IT Solutions
Get 3x more responses when you reach out via email instead of LinkedIn.
Responsibilities
A solid foundation in data engineering, with experience in building and maintaining scalable data pipelines using technologies like Apache Spark, Kafka, SQL, and NoSQL databases
Proficiency in programming languages such as Python, Java, or Scala, and have experience with ETL frameworks and data workflow orchestration tools
Hands-on experience with cloud platforms (e.g., AWS, Google Cloud, Azure) and are skilled in leveraging cloud-based data storage and processing solutions.
Familiarity with containerization and orchestration technologies like Docker and Kubernetes, and can deploy and manage data infrastructure in cloud environments.
Adept at identifying inefficiencies in data systems and can proactively implement improvements to enhance performance and reliability
Strong commitment to data quality, ensuring that all data processes are accurate, consistent, and reliable.
Experience working with synthetic data generation, AI/ML model deployment, or similar projects, and are excited by the unique challenges and opportunities in this area
Familiarity with privacy-preserving technologies and have an understanding of the ethical considerations related to synthetic data
Synthetic data and research experience
Data engineer who has worked as an end-to-end data engineer on a small team
Build the Engine
Building data pipelines to feed the engine.
Model deployment experience
Post-model experience deploying, and managing drift after the model has been running
Containerization and deployment experience
GDPR is nice to have – the data we deal with is personal
Some understanding of compliance.
Data masking
Qualification
Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.
Required
MASTERS DEGREE OR HIGHER
A solid foundation in data engineering, with experience in building and maintaining scalable data pipelines using technologies like Apache Spark, Kafka, SQL, and NoSQL databases
Proficiency in programming languages such as Python, Java, or Scala, and have experience with ETL frameworks and data workflow orchestration tools
Hands-on experience with cloud platforms (e.g., AWS, Google Cloud, Azure) and are skilled in leveraging cloud-based data storage and processing solutions
Familiarity with containerization and orchestration technologies like Docker and Kubernetes, and can deploy and manage data infrastructure in cloud environments
Adept at identifying inefficiencies in data systems and can proactively implement improvements to enhance performance and reliability
Strong commitment to data quality, ensuring that all data processes are accurate, consistent, and reliable
Experience working with synthetic data generation, AI/ML model deployment, or similar projects, and are excited by the unique challenges and opportunities in this area
Familiarity with privacy-preserving technologies and have an understanding of the ethical considerations related to synthetic data
Synthetic data and research experience
Data engineer who has worked as an end-to-end data engineer on a small team
Build the Engine
Building data pipelines to feed the engine
Model deployment experience
Post-model experience deploying, and managing drift after the model has been running
Containerization and deployment experience
Some understanding of compliance
Data masking
Preferred
GDPR is nice to have – the data we deal with is personal
Company
Dexian IT Solutions
Dexian IT Solutions, a unit of Dexian, is an outcome-driven service and solutions partner that serves enterprises across their Information Technology operations.