Gemini · 2 months ago
Principal, Data Engineer
Gemini is a global crypto and Web3 platform, offering a wide range of crypto products and services. The Principal Data Engineer will set the technical direction for data modeling, processing, and delivery, partnering with various teams to ensure data architecture is scalable and reliable.
CryptocurrencyFinanceFinancial ServicesFinTechWeb3
Responsibilities
Define and drive the long-term vision for data architecture, modeling, and transformation at Gemini
Establish standards for data reliability, observability, and quality across all pipelines and data products using languages and frameworks such as Python, SQL, Spark, Flink, Beam, or equivalents
Partner with Staff and Senior Data Engineers, Platform Engineers, and Analytics Engineers to unify how data is produced, stored, and consumed
Lead large-scale design initiatives that span multiple teams and systems, ensuring maintainability, performance, and security
Partner with data scientists, ML engineers, analysts, and product teams to understand data requirements, define SLAs, and deliver coherent data products that others can self-serve
Establish data quality, validation, observability, and monitoring frameworks (data auditing, alerting, anomaly detection, data lineage)
Investigate and resolve complex production issues: root cause analysis, performance bottlenecks, data integrity, fault tolerance
Mentor and guide more junior and mid-level data engineers: lead code reviews, design reviews, and best-practice evangelism
Help recruit and onboard new talent, shaping the future of Gemini’s data engineering discipline
Stay up to date on new tools, technologies, and patterns in the data and cloud space, bringing proposals and proof-of-concepts when appropriate
Document data flows, data dictionaries, architecture patterns, and operational runbooks
Qualification
Required
10+ years of experience in data engineering (or similar) roles
Strong experience in ETL/ELT pipeline design, implementation, and optimization
Deep expertise in Python and SQL writing production-quality, maintainable, testable code
Experience with large-scale data warehouses (e.g. Databricks, BigQuery, Snowflake)
Solid grounding in software engineering fundamentals, data structures, and systems thinking
Hands-on experience in data modeling (dimensional modeling, normalization, schema design)
Experience building systems with real-time or streaming data (e.g. Kafka, Kinesis, Flink, Spark Streaming), and familiarity with CDC frameworks
Experience with orchestration / workflow frameworks (e.g. Airflow)
Familiarity with data governance, lineage, metadata, cataloging, and data quality practices
Strong cross-functional communication skills; ability to translate between technical and non-technical stakeholders
Proven experience in recruiting, mentoring, leading design discussions, and influencing data-engineering best practices across teams
Preferred
Experience with crypto, financial services, trading, markets, or exchange systems
Experience with blockchain, crypto, Web3 data — e.g. blocks, transactions, contract calls, token transfers, UTXO/account models, on-chain indexing, chain APIs, etc
Experience with infrastructure as code, containerization, and CI/CD pipelines
Hands-on experience managing and optimizing Databricks on AWS
Benefits
Competitive starting salary
A discretionary annual bonus
Long-term incentive in the form of a new hire equity grant
Comprehensive health plans
401K with company matching
Paid Parental Leave
Flexible time off
Company
Gemini
Gemini is a licensed digital asset exchange and custodian built for both individuals and institutions.
H1B Sponsorship
Gemini has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2024 (1)
Funding
Current Stage
Public CompanyTotal Funding
$499.9MKey Investors
RippleDraper DragonMorgan Creek Digital
2025-09-12IPO
2025-07-10Debt Financing· $75M
2022-06-20Secondary Market· $1M
Recent News
Analytics Insight: Latest AI, Crypto, Tech News & Analysis
2026-01-11
2026-01-07
Company data provided by crunchbase