Data Engineer — Analytics Infrastructure (Foundational Hire) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Vast.ai · 1 day ago

Data Engineer — Analytics Infrastructure (Foundational Hire)

Vast.ai is a company dedicated to democratizing and decentralizing AI computing. They are seeking a Data Engineer to build and own their data platform, focusing on ingestion, modeling, governance, and self-serve analytics for various departments.

AI InfrastructureArtificial Intelligence (AI)Cloud ComputingGPUMachine Learning

Responsibilities

Own the data pipeline: design, build, and operate batch/streaming ingestion from product, billing, CRM, support, and marketing/ad platforms into a central warehouse
Model the data: create clean, well‑documented staging and business marts (dimensional/star schemas) that map to the needs of Marketing, Sales, Accounting/Finance, and Operations
Enable: publish certified datasets with row‑/column‑level security, manage refresh SLAs, and make it easy for teams to self‑serve
Collaborate cross‑functionally: intake requirements, translate them into data contracts and models, and partner with Engineering on event/telemetry capture
Document & scale: maintain clear docs, lineage, and a pragmatic data catalog so others can discover and trust the data

Qualification

Data EngineeringAWSSQLPythonData ModelingETL/ELTQuickSightCollaborationCommunicationDocumentation

Required

3+ years (typically 3–6) in a Data Engineering role building production ELT/ETL on a cloud platform (AWS strongly preferred)
Expert SQL and solid Python for data processing/automation
Proven experience designing data models (staging, marts, star schemas) and standing up a warehouse/lakehouse
Orchestration, scheduling, and operational ownership (SLAs, alerting, runbooks)
Experience enabling a BI layer (ideally QuickSight) with secure, governed datasets
Strong collaboration and communication; able to gather requirements from non‑technical stakeholders and translate to data contracts

Preferred

Marketing/Sales/RevOps data (CRM, ads, attribution), Accounting/Finance integrations, or product telemetry/event pipelines
Stream processing (Kafka/Kinesis), CDC, or near‑real‑time ingestion
Data privacy/security best practices (e.g., CPRA), partitioning/performance tuning, and cost management on AWS

Benefits

Comprehensive health, dental, vision, and life insurance
401(k) with company match
Meaningful early-stage equity
Onsite meals, snacks, and close collaboration with founders/tech leaders
Ambitious, fast-paced startup culture where initiative is rewarded

Company

Vast.ai

twittertwittertwitter
company-logo
Global GPU rental platform for saving 5-6X on GPU compute using one simple interface.

Funding

Current Stage
Early Stage
Total Funding
$4M
Key Investors
DRW Venture Capital
2024-07-12Seed· $4M

Leadership Team

leader-logo
Travis Cannell
CMO
linkedin
Company data provided by crunchbase