Backflip · 2 months ago
Staff Machine Learning Data Engineer
Backflip is building a foundation model for mechanical design, aiming to unify engineering knowledge into an intelligent design environment. The Staff Machine Learning Data Engineer will lead the development of data pipelines that support this model, collaborating with Machine Learning Engineers to enhance model performance through data-driven experiments.
FinTechiOSPropTechReal Estate Investment
Responsibilities
Architect and own Backflip’s ML data pipeline, from ingestion to processing to evaluation
Define data strategy: establish best practices for data augmentation, filtering, and sampling at scale
Design scalable data systems for multimodal training (text, geometry, CAD, and more)
Develop and automate data collection, curation, and validation workflows
Collaborate with MLEs to design and execute experiments that measure and improve model performance
Build tools and metrics for dataset analysis, monitoring, and quality assurance
Contribute to model development through insights grounded in data, shaping what, how, and when we train
Qualification
Required
You've built and maintained ML data pipelines at scale, ideally for foundation or generative models, that shipped into production in the real world
You have deep experience with data engineering for ML, including distributed systems, data extraction, transformation, and loading, and large-scale data processing (e.g. PySpark, Beam, Ray, or similar)
You're fluent in Python and experienced with ML frameworks and data formats (Parquet, TFRecord, HuggingFace datasets, etc.)
You've developed data augmentation, sampling, or curation strategies that improved model performance
You think like both an engineer and an experimentalist: curious, analytical, and grounded in evidence
You collaborate well across AI development, infra, and product, and enjoy building the data systems that make great models possible
You care deeply about data quality, reproducibility, and scalability
You're excited to help shape the future of AI for physical design
Preferred
You are comfortable working with a variety of complex data formats, e.g. for 3D geometry kernels or rendering engines
You have an interest in math, geometry, topology, rendering, or computational geometry
You've worked in 3D printing, CAD, or computer graphics domains
Company
Backflip
Backflip is a proptech fintech firm that enables real estate entrepreneurs to enhance local communities by revitalizing housing.
Funding
Current Stage
Growth StageTotal Funding
$344.26MKey Investors
Performance Trust Capital PartnersECMCFirstMark
2025-12-04Debt Financing· $100M
2025-11-14Series Unknown· $10M
2024-10-15Series A· $30M
Leadership Team
Recent News
2025-12-05
2025-04-23
Company data provided by crunchbase