SIGN IN
ML/AI Data Engineer (Remote) jobs in United States
cer-icon
Apply on Employer Site
company-logo

FEI Systems · 1 day ago

ML/AI Data Engineer (Remote)

FEI Systems is dedicated to creating innovative technology solutions for health and human services. They are seeking a Data Engineer to support enterprise Machine Learning and Artificial Intelligence initiatives by building, operating, and maintaining data pipelines and ensuring high-quality data for ML/AI solutions.
Information ServicesInformation Technology
check
H1B Sponsor Likelynote

Responsibilities

Design, build, and maintain scalable data pipelines to support ML/AI workloads
Ingest data from multiple sources into the Snowflake data lake using batch and streaming patterns
Develop and maintain ELT pipelines leveraging Snowflake-native capabilities
Ensure pipelines are reliable, performant, and production-ready
Perform data transformations directly in Snowflake using SQL and Snowflake features
Design and optimize schemas, tables, views, and materialized views for ML/AI consumption
Implement transformation logic supporting analytics, feature engineering, and model training
Optimize Snowflake usage for performance and cost efficiency
Implement data quality checks, validation rules, and monitoring within pipelines and Snowflake
Support data governance initiatives including metadata management, lineage, and access controls
Ensure datasets adhere to enterprise standards for security, privacy, and compliance
Identify, troubleshoot, and remediate data quality issues impacting ML/AI workflows
Perform data cleansing, normalization, and enrichment to support ML model development
Design and implement feature engineering pipelines, including feature aggregation and transformation
Ensure consistency, reuse, and versioning of features across models and use cases
Collaborate with ML engineers and data scientists to operationalize features from Snowflake into training pipelines
Support and execute model training workflows, including dataset preparation and refreshes
Automate data preparation steps for experimentation, retraining, and scheduled runs
Ensure training datasets and features are reproducible, traceable, and auditable
Integrate data pipelines and Snowflake transformations into CI/CD workflows
Support version control, testing, and deployment of data assets
Monitor pipeline health, data freshness, and downstream impacts on ML/AI systems
Partner with platform, ML, and DevOps teams to improve operational maturity

Qualification

PythonSQLSnowflakeAWSData EngineeringMLOpsCI/CDData QualityFeature EngineeringData Governance

Required

Strong proficiency in Python for data processing and pipeline development
Advanced SQL skills, with hands-on experience transforming data in Snowflake
Experience designing ELT pipelines using Snowflake as the central data lake
Understanding of Snowflake performance tuning and cost optimization concepts
Experience working within the AWS ecosystem, including services such as: S3, Glue, Athena, Lambda, Step Functions, Kinesis, Snowpipe or MSK (preferred)
Experience integrating Snowflake with AWS-based ingestion and processing pipelines
Exposure to Amazon SageMaker data preparation and training workflows
Understanding of data requirements for machine learning and AI workloads
Experience preparing training datasets and features from enterprise data lakes
Familiarity with reproducibility, dataset versioning, and data lineage concepts
Experience operating within a structured SDLC
Familiarity with CI/CD pipelines for data and ML workflows
Understanding of API-based and event-driven data integration patterns
Experience supporting distributed data processing environments
Bachelor's degree in Computer Science, Machine Learning, Artificial Intelligence, or related field

Preferred

Experience supporting ML/AI platforms or products in production
Familiarity with feature stores and ML data management tools
Exposure to data observability, quality, and monitoring solutions
Experience working in governance-heavy or regulated environments
Snowflake or AWS certifications (preferred, not required)
Experience leveraging ML/AI in a highly regulated healthcare environment (Understanding of HIPAA, 42CFR Part 2 and other privacy regulations)

Benefits

Full company benefits

Company

FEI Systems

twittertwittertwitter
company-logo
FEi is a leading information technology, services, and analysis

H1B Sponsorship

FEI Systems has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (24)
2024 (20)
2023 (24)
2022 (36)
2021 (30)
2020 (31)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Craig Steffen
Chief Executive Officer
linkedin
leader-logo
Jiao Gu
President & CEO
linkedin
Company data provided by crunchbase