Research Engineer - Data Infrastructure/ML jobs in United States
cer-icon
Apply on Employer Site
company-logo

Third Dimension AI · 2 months ago

Research Engineer - Data Infrastructure/ML

Third Dimension AI is building SuperSim, a new kind of simulator that enables fast, cost-effective, and photorealistic 3D simulations using AI. They are seeking a Data Infrastructure / ML Engineer to design scalable pipelines and develop ML workflows for their 3D AI systems, ultimately enhancing how robots and autonomous systems are tested in real-world scenarios.

Generative AISoftwareWeb Development

Responsibilities

Build and maintain high-performance data pipelines to ingest, transform, and version multi-modal datasets (3D, video, sensor)
Design and optimize distributed training and data-processing infrastructure - across cloud and containerized environments (Kubernetes, Ray, Dask, EKS, Buildkite)
Collaborate with researchers to productionize ML models (PyTorch), from prototype to deployment
Develop tools and APIs that make data discoverable, reusable, and reproducible
Monitor and improve data quality, lineage, and performance across the ML lifecycle

Qualification

PythonDistributed data processingCloud infrastructureML workflowsDataset versioningSimulation exposureDeep learning frameworks3D data formatsOpen-source contributionsFrontend/UI experience

Required

Build and maintain high-performance data pipelines to ingest, transform, and version multi-modal datasets (3D, video, sensor)
Design and optimize distributed training and data-processing infrastructure - across cloud and containerized environments (Kubernetes, Ray, Dask, EKS, Buildkite)
Collaborate with researchers to productionize ML models (PyTorch), from prototype to deployment
Develop tools and APIs that make data discoverable, reusable, and reproducible
Monitor and improve data quality, lineage, and performance across the ML lifecycle

Preferred

Strong programming background in Python
Proficiency in distributed data-processing technologies (e.g., Ray, Apache Spark, Flyte, Dask)
Hands-on experience with cloud infrastructure (AWS, GCP, or Azure), Kubernetes, and distributed training frameworks (Ray, RLLib, PyTorch DDP, or Horovod)
Knowledge of dataset versioning, experiment tracking, and reproducibility tools (DVC, MLflow, etc.)
Exposure to simulation, robotics, or autonomy testing pipelines
Experience with deep learning frameworks and ML workflows (PyTorch)
Familiarity with 3D or robotics data formats (point clouds, meshes, radiance fields, LIDAR)
Contributions to open-source ML/infra projects or publications in top-tier AI/ML/CV venues
Experience with frontend / UI work

Benefits

Competitive salary & stock options – Everyone is an owner and shares in our success.
Pension / retirement plan – Company contributions to support your long-term financial wellbeing.
Health & wellness – Comprehensive health, dental, and vision insurance (with regional equivalents for employees outside the UK).
Flexible time off – Generous holiday allowance plus local public holidays.
Hybrid-first culture – State-of-the-art London workspace with the flexibility to work remotely.
Workspace support – Latest hardware and home office equipment to set you up for success.
Learning & development – Budget and time for conferences, courses, and workshops to keep you at the frontier of ML and 3D AI.
Community & connection – Team offsites, events, and opportunities to connect with global colleagues.

Company

Third Dimension AI

twittertwitter
company-logo
Third Dimension offers immersive quality, rendering engine ready content that can be used by experts in a variety of sectors.

Funding

Current Stage
Early Stage
Total Funding
$7M
Key Investors
Felicis
2024-10-08Seed· $7M
Company data provided by crunchbase