Machine Learning Data Engineer #939106 @ Dexian IT Solutions | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
External
0
Machine Learning Data Engineer #939106 jobs in United States
200+ applicants
company-logo

Dexian IT Solutions · 22 hours ago

Machine Learning Data Engineer #939106

ftfMaximize your interview chances
ConsultingDigital Media
badNo H1Bnote
Hiring Manager
John Pottebaum
linkedin

Insider Connection @Dexian IT Solutions

Discover valuable connections within the company who might provide insights and potential referrals.
Get 3x more responses when you reach out via email instead of LinkedIn.

Responsibilities

A solid foundation in data engineering, with experience in building and maintaining scalable data pipelines using technologies like Apache Spark, Kafka, SQL, and NoSQL databases
Proficiency in programming languages such as Python, Java, or Scala, and have experience with ETL frameworks and data workflow orchestration tools
Hands-on experience with cloud platforms (e.g., AWS, Google Cloud, Azure) and are skilled in leveraging cloud-based data storage and processing solutions.
Familiarity with containerization and orchestration technologies like Docker and Kubernetes, and can deploy and manage data infrastructure in cloud environments.
Adept at identifying inefficiencies in data systems and can proactively implement improvements to enhance performance and reliability
Strong commitment to data quality, ensuring that all data processes are accurate, consistent, and reliable.
Experience working with synthetic data generation, AI/ML model deployment, or similar projects, and are excited by the unique challenges and opportunities in this area
Familiarity with privacy-preserving technologies and have an understanding of the ethical considerations related to synthetic data
Synthetic data and research experience
Data engineer who has worked as an end-to-end data engineer on a small team
Build the Engine
Building data pipelines to feed the engine.
Model deployment experience
Post-model experience deploying, and managing drift after the model has been running
Containerization and deployment experience
GDPR is nice to have – the data we deal with is personal
Some understanding of compliance.
Data masking

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Data EngineeringApache SparkKafkaSQLNoSQL databasesPythonJavaScalaETL frameworksData workflow orchestrationAWSGoogle CloudAzureDockerKubernetesSynthetic data generationAI/ML model deploymentData quality assuranceContainerization experienceGDPR knowledgeData maskingCompliance understanding

Required

MASTERS DEGREE OR HIGHER
A solid foundation in data engineering, with experience in building and maintaining scalable data pipelines using technologies like Apache Spark, Kafka, SQL, and NoSQL databases
Proficiency in programming languages such as Python, Java, or Scala, and have experience with ETL frameworks and data workflow orchestration tools
Hands-on experience with cloud platforms (e.g., AWS, Google Cloud, Azure) and are skilled in leveraging cloud-based data storage and processing solutions
Familiarity with containerization and orchestration technologies like Docker and Kubernetes, and can deploy and manage data infrastructure in cloud environments
Adept at identifying inefficiencies in data systems and can proactively implement improvements to enhance performance and reliability
Strong commitment to data quality, ensuring that all data processes are accurate, consistent, and reliable
Experience working with synthetic data generation, AI/ML model deployment, or similar projects, and are excited by the unique challenges and opportunities in this area
Familiarity with privacy-preserving technologies and have an understanding of the ethical considerations related to synthetic data
Synthetic data and research experience
Data engineer who has worked as an end-to-end data engineer on a small team
Build the Engine
Building data pipelines to feed the engine
Model deployment experience
Post-model experience deploying, and managing drift after the model has been running
Containerization and deployment experience
Some understanding of compliance
Data masking

Preferred

GDPR is nice to have – the data we deal with is personal

Company

Dexian IT Solutions

twittertwitter
company-logo
Dexian IT Solutions, a unit of Dexian, is an outcome-driven service and solutions partner that serves enterprises across their Information Technology operations.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Heidi Lavulo
Wells Fargo Advisors Executive Office- ECMO
linkedin
leader-logo
Lydia Wilson (she, her, hers)
Chief People Officer
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot