Taskify AI ยท 6 hours ago
Dockerfile Data Validation Engineer (Remote)
Taskify AI is seeking an experienced engineer to design, implement, and maintain data-validation workflows within Docker-based build pipelines. The role involves creating and managing Dockerfiles, collaborating with data engineering, machine learning, and DevOps teams to ensure quality and compliance of datasets and model artifacts before deployment.
Higher Education
Responsibilities
Develop and optimize Dockerfiles with built-in data-validation steps
Implement LABEL metadata for dataset versions, schemas, and lineage tracking
Create validation scripts using Python or Bash for schema checks, data integrity, and quality control
Integrate validation steps into CI/CD pipelines and enforce fail-on-bad-data checks
Document standards for Dockerfile labeling, validation logic, and data governance
Collaborate with cross-functional teams to ensure reproducibility and reliability of containerized pipelines
Qualification
Required
4+ years of experience in DevOps, software engineering, or data engineering
Strong experience with Docker and Dockerfile creation
Proficiency in Python or Bash scripting for validation purposes
Understanding of data formats, schemas, and validation tools
Familiarity with CI/CD systems and container registries
Strong attention to detail and process-oriented mindset
Company
Taskify AI
While our platform provides the intelligence, our Staffing & Placement team provides the results.
Funding
Current Stage
Early StageCompany data provided by crunchbase