Arcade · 1 day ago
Lead, Data Operations & Evaluation Engineering
Arcade is building the world’s first AI physical product creation platform, where imagination becomes reality. The Lead, Data Operations & Evaluation Engineering will establish and execute the company-wide AI data strategy and operations, focusing on sourcing and processing diverse data to fuel generative AI models and evaluating those models effectively.
Artificial Intelligence (AI)E-CommerceJewelry
Responsibilities
Develop and Execute AI Data Strategy: Define and lead the comprehensive data strategy for arcade.ai, ensuring the collection, curation, and governance of unique, vast, and diverse datasets (e.g., jewelry specifications, home decor designs, material properties) are optimized for generative AI model development and training
AI Data Acquisition & Management: Define training data requirements in partnership with the AI team and CEO to support model development and research objectives. Lead and implement acquisition strategy including both original data production as well as strategic partnerships. Drive execution of data organization and acquisition plans
Establish Data Operations (DataOps): Build and manage the DataOps function, creating scalable, automated, and quality-controlled pipelines for data ingestion, cleaning, normalization, enrichment, and labeling, specifically around complex, multi-modal product design data
Lead Data Annotation Operations: Design, implement, and oversee highly efficient and quality-focused data annotation pipelines for complex data types (e.g., semantic segmentation, feature labeling, quality scoring) critical for training generative design models. Manage vendor relationships or internal teams dedicated to annotation
Oversee Model Evaluation Data: Collaborate closely with the AI/ML Engineering team to establish and manage the datasets, metrics, and processes used for continuous model evaluation and testing (A/B testing, human-in-the-loop validation, performance benchmarking) to ensure the generated designs meet quality and utility standards
Perform AI Evaluation Engineering: Design and build labeling pipelines, human-in-the-loop evaluations, automated evaluation, and eval metrics. Build labeling tools, craft eval suites (e.g. CLIP similarity, detection accuracy), manage datasets
Data Governance & Quality: Implement best-in-class data governance policies, standards, and procedures to ensure data accuracy, consistency, security, and compliance (e.g., intellectual property rights for makers’ designs)
Cross-Functional Leadership: Partner closely with the company leadership, AI/ML Engineering, Product, and Supply Chain teams to translate model requirements into actionable data collection and preparation strategies, ensuring the data roadmap aligns with product development goals
Infrastructure & Tooling: Oversee the selection, implementation, and management of data infrastructure, storage solutions (e.g., data lakes, feature stores), and specialized tooling required for large-scale AI data management. Establish and maintain robust data infrastructure that ensures secure, scalable storage, organization, and utilization of training data. Define back-end engineering requirements when necessary, working with AI and web engineering leads to ensure seamless integration with product needs
Team Leadership: Recruit, mentor, and lead a high-performing team of data strategists, data engineers, and data quality analysts
Qualification
Required
7+ years of progressive experience in Data Strategy, Data Operations, or Data Science leadership roles, with a significant focus on data for AI/ML models
Proven Experience in Generative AI: Direct experience building and scaling the data pipelines and strategies required to support complex generative AI models (e.g., image, 3D, or text-to-design models) is highly preferred
Domain Expertise: Familiarity with the unique challenges of multi-modal data and datasets related to e-commerce, manufacturing, or product design (e.g., CAD, visual assets, material science data)
Leadership & Execution: Demonstrated ability to transition from high-level strategy to hands-on execution, building robust, production-ready data systems from the ground up
Data Governance Acumen: Deep understanding of data quality principles, metadata management, and data ethics, especially concerning proprietary or intellectual property data
Technical Proficiency: Strong knowledge of modern data technologies (e.g., cloud data warehousing, ETL/ELT tools, distributed systems) and data science programming languages (Python, SQL)
Efficiency Mindset: Seeks continuous improvement in process and efficiency via tools and automation, with demonstrated ability to deliver continuous improvement preferred
Design and Taste Sensibility: An eye for and appreciation of aesthetics. Have an intuitive understanding of what makes for high quality, tasteful, well crafted consumer products
Education: Bachelor's, MBA or Master's degree in Computer Science, Data Science, Engineering, or a related field
Company
Arcade
Arcade is an AI product creation platform that allows users to design and sell custom products.
H1B Sponsorship
Arcade has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
Funding
Current Stage
Early StageTotal Funding
$47MKey Investors
Laura ChauAshton Kutcher
2025-03-24Series A· $25M
2024-09-20Seed· $5M
2024-09-20Seed· $17M
Recent News
Crunchbase News
2025-11-25
Company data provided by crunchbase