DataPelago · 2 months ago
Principal Data Processing Engineer
DataPelago is at the forefront of revolutionizing data processing for traditional analytics and cutting-edge GenAI preprocessing. As a Principal Data Processing Engineer, you will lead architectural design and implementation to enhance the performance and reliability of the data processing engine, making a significant impact on a category-defining product.
AnalyticsHardwareSoftware
Responsibilities
Architectural Leadership: Drive the evolution of our parallel and distributed execution engine architecture, with a strong focus on leveraging accelerated computing technologies
End to End Ownership: Lead the execution engine team in the complete lifecycle of design, implementation, and rollout of an enterprise-grade product
Core Development: Individually design, implement, test, and maintain critical components of the data processing execution engine
Innovation and Differentiation: Analyze technology advances from industry and academia to identify opportunities for the engine to enhance technology and product leadership
Collaboration and Mentorship: Partner effectively with engineering, product management, and customer success teams. Guide and mentor engineers on the execution engine team
Continuous Improvement: Foster best practices in design and code reviews, testing, CI/CD, and issue resolution to maintain highest product quality, security, efficiency, & productivity
Qualification
Required
Bachelor's degree in Computer Science, or a related field with 15+ years of relevant experience OR a Master's degree in Computer Science or a related field with 10+ years of relevant experience
10+ years of deep technical experience in developing core components of enterprise-grade database or analytics execution engines designed for large-scale data processing
Proven expertise in developing high-performance parallel implementations of data processing operators and functions on rich data types
Demonstrated experience leading teams of 10+ engineers in the design, development, and successful release of high-performance data processing engines for large production deployments
Exceptional programming skills in C, C++, and Rust
Extensive development experience in Linux environments
Strong analytical and problem-solving skills with a passion for performance optimization
Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to both technical and non-technical audiences
Preferred
Significant experience developing for and understanding the internals of platforms such as Apache Spark, Apache Flink, Apache Doris, Apache Gluten, Velox, Apache DataFusion, or Apache DataFusion Comet is highly preferred
Benefits
Competitive compensation
Stock options
Comprehensive benefits package
Leadership development opportunities
Company
DataPelago
DataPelago is a software development company that offer big data analytics solutions.
H1B Sponsorship
DataPelago has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2024 (3)
2023 (7)
2022 (1)
2021 (3)
Funding
Current Stage
Growth StageTotal Funding
$55MKey Investors
Eclipse VenturesTaiwania Capital Management Corporation
2024-10-01Series A· $47M
2022-10-06Series A
2021-04-30Seed· $8M
Recent News
Best Data Management Software, Vendors and Data Science Platforms
2025-08-23
solutionsreview.com
2025-08-23
Company data provided by crunchbase