200+ applicants

Company

Dexian IT Solutions · 20 hours ago

Machine Learning Data Engineer #939106

United States

Full-time

Remote

Mid, Senior Level

$120K/yr - $160K/yr

5+ years exp

Maximize your interview chances

ConsultingDigital Media

No H1B

Hiring Manager

John Pottebaum

Insider Connection @Dexian IT Solutions

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

A solid foundation in data engineering, with experience in building and maintaining scalable data pipelines using technologies like Apache Spark, Kafka, SQL, and NoSQL databases

Proficiency in programming languages such as Python, Java, or Scala, and have experience with ETL frameworks and data workflow orchestration tools

Hands-on experience with cloud platforms (e.g., AWS, Google Cloud, Azure) and are skilled in leveraging cloud-based data storage and processing solutions.

Familiarity with containerization and orchestration technologies like Docker and Kubernetes, and can deploy and manage data infrastructure in cloud environments.

Adept at identifying inefficiencies in data systems and can proactively implement improvements to enhance performance and reliability

Strong commitment to data quality, ensuring that all data processes are accurate, consistent, and reliable.

Experience working with synthetic data generation, AI/ML model deployment, or similar projects, and are excited by the unique challenges and opportunities in this area

Familiarity with privacy-preserving technologies and have an understanding of the ethical considerations related to synthetic data

Synthetic data and research experience

Data engineer who has worked as an end-to-end data engineer on a small team

Build the Engine

Building data pipelines to feed the engine.

Model deployment experience

Post-model experience deploying, and managing drift after the model has been running

Containerization and deployment experience

GDPR is nice to have – the data we deal with is personal

Some understanding of compliance.

Data masking

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Data EngineeringApache SparkKafkaSQLNoSQL databasesPythonJavaScalaETL frameworksData workflow orchestrationAWSGoogle CloudAzureDockerKubernetesSynthetic data generationAI/ML model deploymentData quality assuranceContainerization experienceGDPR knowledgeData maskingCompliance understanding

Required

MASTERS DEGREE OR HIGHER

A solid foundation in data engineering, with experience in building and maintaining scalable data pipelines using technologies like Apache Spark, Kafka, SQL, and NoSQL databases

Proficiency in programming languages such as Python, Java, or Scala, and have experience with ETL frameworks and data workflow orchestration tools

Hands-on experience with cloud platforms (e.g., AWS, Google Cloud, Azure) and are skilled in leveraging cloud-based data storage and processing solutions

Familiarity with containerization and orchestration technologies like Docker and Kubernetes, and can deploy and manage data infrastructure in cloud environments

Adept at identifying inefficiencies in data systems and can proactively implement improvements to enhance performance and reliability

Strong commitment to data quality, ensuring that all data processes are accurate, consistent, and reliable

Experience working with synthetic data generation, AI/ML model deployment, or similar projects, and are excited by the unique challenges and opportunities in this area

Familiarity with privacy-preserving technologies and have an understanding of the ethical considerations related to synthetic data

Synthetic data and research experience

Data engineer who has worked as an end-to-end data engineer on a small team

Build the Engine

Building data pipelines to feed the engine

Model deployment experience

Post-model experience deploying, and managing drift after the model has been running

Containerization and deployment experience

Some understanding of compliance

Data masking

Preferred

GDPR is nice to have – the data we deal with is personal

Company

Dexian IT Solutions

Dexian IT Solutions, a unit of Dexian, is an outcome-driven service and solutions partner that serves enterprises across their Information Technology operations.

Founded in 1994

Mclean, Virginia, USA

1,001-5,000 employees