Lead Data Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Verdigris · 2 months ago

Lead Data Engineer

Verdigris is on a mission to sustain and enrich human life through responsive energy intelligence. The Lead Data Engineer will design and implement a modern data architecture, managing data flows and ensuring data quality to support climate-focused outcomes at scale.

Artificial Intelligence (AI)Big DataEnergyInternet of ThingsSaaS
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Collaborate with Product Management, Understand use cases and personas, and engineer product to support a strong user experience
Own schema design and data modeling for energy metering and building management system (BMS) data
Architect and maintain cost-effective and performant next generation data storage (e.g. ClickHouse, StarTree, etc)
Lead data architecture decisions, including evaluating and integrating tools in our modern data stack
Build and manage robust, scalable ETL/ELT pipelines to ingest, transform, and serve data
Ensure performance and efficiency of analytical queries across large datasets
Develop and enforce data quality, validation, and governance standards
Support real-time IoT analytics and streaming pipelines
Owning BI tooling (e.g. Superset, Looker, Tableau, etc)
Contribute to building internal data tools for engineers and analysts
Collaborate with AI/ML teams to support model training and inference pipelines
Work with web and application teams to ensure real-time and batch data access needs are met
Manage team projects and coordinate with other technical leads
Mentor junior engineers and contribute to technical hiring

Qualification

Data engineeringOLAP schema designColumnar databasesSQL proficiencyPython for data pipelinesETL workflowsAWS CloudIoT data systemsData quality standardsData observabilityTeam managementMentoringSoft skills

Required

Align with core working hours, 10:00AM PST to 5:00PM PST in either pacific, mountain, or central timezones
5+ years of experience in data engineering with large-scale, high-throughput systems
Proven experience designing dimensional models and OLAP schema (fact/dimension tables)
Deep understanding of columnar stores and database internals (e.g., ClickHouse, Druid, StarTree, Pinot)
Strong SQL skills and proficiency with Python for data pipelines
Experience handling updates/inserts/type-2 dimensions for time-series or large-scale event stores

Preferred

Experience with BMS/HVAC or Energy data is a plus
Experience with usage of time series and energy data used for diagnostics and efficiency
Experience with IoT or sensor data systems
Experience working in AWS Cloud
Experience with Postgres
Proficiency in orchestrating ETL workflows (e.g. Dagster, Airflow, AWS Step Functions, etc.)
Familiarity with stream processing tools (e.g., Kafka, Flink, Spark Streaming)
Exposure to machine learning feature stores or MLOps tooling
Experience with data observability and data cataloging tools
Experience managing a team or others

Company

Verdigris

twittertwittertwitter
company-logo
Verdigris provides AI-driven, real-time electrical intelligence for the most power-intensive and complex data center operations.

H1B Sponsorship

Verdigris has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (1)
2022 (1)
2021 (1)
2020 (2)

Funding

Current Stage
Growth Stage
Total Funding
$51.53M
Key Investors
Startup Island TAIWANOyster VenturesJabil
2023-08-16Series Unknown· $10M
2020-10-01Series Unknown
2020-04-09Debt Financing· $6.69M

Leadership Team

leader-logo
Mark Chung
Co-Founder, CEO
linkedin
leader-logo
Jonathan Chu
Cofounder, CTO
linkedin
Company data provided by crunchbase