Principal Data Processing Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

DataPelago · 2 months ago

Principal Data Processing Engineer

DataPelago is at the forefront of revolutionizing data processing for traditional analytics and cutting-edge GenAI preprocessing. As a Principal Data Processing Engineer, you will lead architectural design and implementation to enhance the performance and reliability of the data processing engine, making a significant impact on a category-defining product.

AnalyticsHardwareSoftware
check
H1B Sponsor Likelynote

Responsibilities

Architectural Leadership: Drive the evolution of our parallel and distributed execution engine architecture, with a strong focus on leveraging accelerated computing technologies
End to End Ownership: Lead the execution engine team in the complete lifecycle of design, implementation, and rollout of an enterprise-grade product
Core Development: Individually design, implement, test, and maintain critical components of the data processing execution engine
Innovation and Differentiation: Analyze technology advances from industry and academia to identify opportunities for the engine to enhance technology and product leadership
Collaboration and Mentorship: Partner effectively with engineering, product management, and customer success teams. Guide and mentor engineers on the execution engine team
Continuous Improvement: Foster best practices in design and code reviews, testing, CI/CD, and issue resolution to maintain highest product quality, security, efficiency, & productivity

Qualification

Apache SparkParallel ProcessingCC++RustLarge-scale Data ProcessingLinux DevelopmentProblem-solvingCommunicationCollaborationMentorship

Required

Bachelor's degree in Computer Science, or a related field with 15+ years of relevant experience OR a Master's degree in Computer Science or a related field with 10+ years of relevant experience
10+ years of deep technical experience in developing core components of enterprise-grade database or analytics execution engines designed for large-scale data processing
Proven expertise in developing high-performance parallel implementations of data processing operators and functions on rich data types
Demonstrated experience leading teams of 10+ engineers in the design, development, and successful release of high-performance data processing engines for large production deployments
Exceptional programming skills in C, C++, and Rust
Extensive development experience in Linux environments
Strong analytical and problem-solving skills with a passion for performance optimization
Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to both technical and non-technical audiences

Preferred

Significant experience developing for and understanding the internals of platforms such as Apache Spark, Apache Flink, Apache Doris, Apache Gluten, Velox, Apache DataFusion, or Apache DataFusion Comet is highly preferred

Benefits

Competitive compensation
Stock options
Comprehensive benefits package
Leadership development opportunities

Company

DataPelago

twittertwitter
company-logo
DataPelago is a software development company that offer big data analytics solutions.

H1B Sponsorship

DataPelago has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2024 (3)
2023 (7)
2022 (1)
2021 (3)

Funding

Current Stage
Growth Stage
Total Funding
$55M
Key Investors
Eclipse VenturesTaiwania Capital Management Corporation
2024-10-01Series A· $47M
2022-10-06Series A
2021-04-30Seed· $8M

Leadership Team

leader-logo
Rajan Goyal
Founder & CEO
linkedin
leader-logo
John Chirapurath
President
linkedin

Recent News

Company data provided by crunchbase