Machine Learning Data Engineer (AWS Sagemaker, Python) jobs in United States
cer-icon
Apply on Employer Site
company-logo

The Computer Merchant, LTD (TCM) · 1 month ago

Machine Learning Data Engineer (AWS Sagemaker, Python)

The Computer Merchant, LTD (TCM) is seeking a Machine Learning Data Engineer to build and maintain data pipelines connecting Oracle-based systems to AWS cloud environments. The role involves collaborating with data scientists to ensure data is optimized for machine learning workloads in AWS SageMaker.

Consumer ElectronicsHuman ResourcesInformation TechnologyStaffing Agency
check
H1B Sponsor Likelynote

Responsibilities

Develop and maintain data pipelines to extract, transform, and load data from Oracle databases and other systems into AWS environments (S3, Redshift, Glue, etc.)
Collaborate with data scientists to ensure data is prepared, cleaned, and optimized for SageMaker-based machine learning workloads
Implement and manage data ingestion frameworks, including batch and streaming pipelines
Automate and schedule data workflows using AWS Glue, Step Functions, or Airflow
Develop and maintain data models, schemas, and cataloging processes for discoverability and consistency
Optimize data processes for performance and cost efficiency
Implement data quality checks, validation, and governance standards
Work with DevOps and security teams to comply with client standards

Qualification

AWS data servicesSQLPythonETL/ELT pipelinesData modelingVersion controlData governanceData orchestrationJavaCollaboration

Required

Strong proficiency with SQL and hands-on experience working with Oracle databases
Experience designing and implementing ETL/ELT pipelines and data workflows
Hands-on experience with AWS data services, such as S3, Glue, Redshift, Lambda, and IAM
Proficiency in Python for data engineering (pandas, boto3, pyodbc, etc.)
Solid understanding of data modeling, relational databases, and schema design
Familiarity with version control, CI/CD, and automation practices
Ability to collaborate with data scientists to align data structures with model and analytics requirements
B.S. in Computer Science, MIS or related degree and a minimum of five (5) years of related experience or combination of education, training and experience

Preferred

Experience integrating data for use in AWS SageMaker or other ML platforms
Exposure to MLOps or ML pipeline orchestration
Familiarity with data cataloging and governance tools (AWS Glue Catalog, Lake Formation)
Knowledge of data warehouse design patterns and best practices
Experience with data orchestration tools (e.g., Apache Airflow, Step Functions)
Working knowledge of Java is a plus

Benefits

Medical, dental and vision benefits
Dependent care flexible spending account
401(k) plan
Voluntary life/short term disability/whole life/term life/accident and critical illness coverage
Employee assistance program
Sick leave in accordance with regulation

Company

The Computer Merchant, LTD (TCM)

twittertwittertwitter
company-logo
The Computer Merchant, LTD®, (TCM) is a Veteran-Owned, nationally recognized Information Technology & Software Engineering staffing and services firm.

H1B Sponsorship

The Computer Merchant, LTD (TCM) has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (1)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
John Danieli
President & CEO
linkedin
Company data provided by crunchbase