Data Engineer - Autonomous Vehicle AI Research Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

ECLARO · 1 day ago

Data Engineer - Autonomous Vehicle AI Research Infrastructure

ECLARO is a leading technology solutions provider looking for a Data Engineer specializing in Autonomous Vehicle AI Research Infrastructure. The role involves designing and maintaining data pipelines for large-scale autonomous vehicle datasets, collaborating with research scientists and machine learning engineers to enhance intelligent driving capabilities.

Staffing & Recruiting
check
H1B Sponsor Likelynote

Responsibilities

Design, implement, and maintain robust data pipelines for ingesting, cleaning, and transforming large-scale autonomous vehicle datasets (camera, LiDAR, radar, GPS, simulation logs)
Develop scalable storage and retrieval systems using AWS services (S3, EC2, SageMaker, Athena, etc.)
Ensure data quality and consistency through automated validation, deduplication, and schema enforcement
Collaborate with ML researchers and engineers to provide efficient access to training data, labels, and metadata
Optimize data preprocessing and batching pipelines to support large-scale training and evaluation workflows
Build tools to manage and audit dataset versions, experiment tracking, and feature reproducibility
Implement and maintain CI / CD workflows for data and pipeline updates, ensuring minimal downtime and reproducible outputs
Monitor data pipeline performance and respond to bottlenecks or outages proactively

Qualification

Data EngineeringPythonAWSSQLData Workflow OrchestrationDistributed ComputingData Quality AssuranceDataset DocumentationCollaboration

Required

B.S. or M.S. in Computer Science, Data Engineering, or a related field
3+ years of experience building production-grade data infrastructure or ML data pipelines
Strong proficiency with Python and SQL, and experience with data workflow orchestration tools (e.g., Airflow, Prefect, Luigi)
Deep experience with AWS services, especially S3 (data storage), EC2 (compute), and SageMaker (model training)
Familiarity with distributed computing frameworks like Spark, Dask, or Ray
Understanding of best practices for dataset documentation, standardization, and reproducibility in research
Experience with autonomous vehicle datasets or robotics sensor data
Familiarity with ML training pipelines and model evaluation workflows
Prior experience collaborating with researchers or applied ML teams in high-throughput environments

Benefits

401k Retirement Savings Plan administered by Merrill Lynch
Commuter Check Pretax Commuter Benefits
Eligibility to purchase Medical, Dental & Vision Insurance through ECLARO

Company

ECLARO

twitter
company-logo
ECLARO is an award-winning professional services firm headquartered in New York City and operating in the U.S., Canada and the Philippines.

H1B Sponsorship

ECLARO has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (1)
2023 (1)
2022 (2)
2020 (1)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Nicholas Butcher
QA CTO Label Specialist
linkedin
leader-logo
Dan Broderick
Chief Delivery Officer
linkedin
Company data provided by crunchbase