Data Engineer - Modern Data Platforms jobs in United States
cer-icon
Apply on Employer Site
company-logo

Re:Build Manufacturing · 1 day ago

Data Engineer - Modern Data Platforms

Re:Build Manufacturing is a growing family of industrial and engineering businesses focused on revitalizing US manufacturing. The Data Engineer will utilize modern data technologies to operationalize and expand the enterprise Data Lake, implementing efficient ingestion strategies and ensuring data is structured for accessibility and analysis.

IndustrialIndustrial AutomationIndustrial ManufacturingMachinery ManufacturingManufacturing

Responsibilities

Co-design data interfaces and pipelines in close collaboration with software engineers and technical leads, ensuring alignment with application domain models and product roadmaps
Build and operate batch, streaming, and change data capture (CDC) pipelines from diverse sources (ERP, CRM, Accounting, knowledge repositories, and other enterprise systems) into the data lake
Model curated data within the lake into data warehouse structures (e.g., star schemas, wide tables, semantic layers) optimized for business intelligence (BI), ad-hoc analytics, and key performance indicator (KPI) reporting
Publish certified datasets and policy-aware retrieval assets (tables, document embeddings, vector indexes) to enable analytics, AI, and retrieval-augmented generation (RAG) use cases
Establish robust data observability and quality checks to ensure reliability and consistency
Apply governance, security, and compliance controls across the data lake and warehouse — including role-based access, encryption, auditing, and data retention — in alignment with applicable regulations
Operate the platform reliably by orchestrating jobs, monitoring pipelines, and continuously tuning cost and performance
Work in accordance with The Re:Build Way, demonstrating collaboration, continuous improvement, and technical excellence in every aspect of data engineering

Qualification

Data Lake ArchitectureData Pipeline DesignETL/ELT WorkflowsCloud Data PlatformsPythonSQLData GovernanceData Quality FrameworksApache SparkBusiness Intelligence ModelingAI/ML Use CasesCommunication SkillsProblem-Solving Skills

Required

5 - 8+ years of proven experience building production-grade data systems with a strong understanding of cloud-based data lake architectures and data warehouses
Demonstrated expertise in designing and operating data pipelines (batch, streaming, CDC), including schema evolution, backfills, and performance tuning
Hands-on proficiency with Python and SQL, including experience with distributed processing frameworks (e.g., Apache Spark) and CI/CD for data workflows
Proven ability to design and implement ETL/ELT workflows and data modeling techniques (e.g., star schemas, wide tables, semantic models)
Proficiency with cloud data platforms and services such as AWS, Databricks, and Snowflake, with a focus on scalability and reliability
Familiarity with open table formats (e.g., Iceberg, Delta, Hudi) and business intelligence data modeling
Understanding of data governance, lineage, and data quality frameworks to ensure reliability, accuracy, and compliance
Experience or strong interest in enabling AI/ML use cases (e.g., RAG/search datasets, embeddings, vector indexes)
Bachelor's degree (BA/BS) in Computer Science, Data Science, Mathematics, Analytics, or a related quantitative field (or equivalent experience)
Fluency in written and spoken English
Brings enthusiasm, curiosity, and a consistently positive attitude
Leads by example — offering guidance, mentorship, and accountability on key technical decisions
Skilled at analyzing complex technical challenges and delivering innovative, efficient solutions
Flexible and adaptable to shifting priorities, requirements, and emerging technologies
Communicates clearly and effectively, both in writing and verbally
Exceptionally organized and thrives in a fast-paced, dynamic environment
Strong analytical and problem-solving abilities with sharp attention to detail
Collaborative team player who works effectively across departments and levels of the organization
Must successfully complete a background check and provide reliable professional references

Benefits

Performance-based bonus
Re:Build incentive stock awards
Annual cash bonus
Long term incentive
Competitive, Comprehensive Benefits Plan.

Company

Re:Build Manufacturing

twittertwitter
company-logo
Re:Build Manufacturing is a family of industrial businesses combining cutting-edge enabling technologies, operational superiority and strategic M&A to build America’s next generation industrial company.

Funding

Current Stage
Late Stage
Total Funding
$121.9M
Key Investors
General CatalystUS Department of Energy
2024-08-14Series Unknown· $120M
2024-05-16Grant· $1.9M

Leadership Team

leader-logo
Miles Arnone
Chief Executive Officer
linkedin
leader-logo
Chad Clawson
Chief Operating Officer
linkedin
Company data provided by crunchbase