Principal Software Engineer, Data Platform jobs in United States
info-icon
This job has closed.
company-logo

griddable.io · 1 day ago

Principal Software Engineer, Data Platform

Griddable.io is seeking a Principal Software Engineer for their Data Platform, who will serve as the primary technical architect responsible for modernizing and optimizing Salesforce's foundational data ecosystem. The role involves architecting the Enterprise Data Platform, integrating advanced technologies, and mentoring engineering teams to enhance technical standards and performance.

AnalyticsBig DataCloud Data ServicesData IntegrationInformation TechnologySaaSSoftware

Responsibilities

Architect the Roadmap: Define the long-term technical architecture for the Enterprise Data Platform. Translate business strategy into technical specifications, ensuring our stack allows for 'Data Mesh' scalability and domain-oriented ownership
Infrastructure as Code (IaC) Evangelism: personally architect and review the Terraform/Helm configurations that define our infrastructure. Ensure that from Snowflake RBAC to Neo4j clusters, our platform is immutable, version-controlled, and reproducible
Performance Engineering: Deep dive into the hardest performance bottlenecks. Optimize query planners, data serialization formats (Parquet/Iceberg), and distributed compute costs across Snowflake and Spark
AI Enablement: Design the integration patterns for AI-assisted tooling (Cursor, MCP, Copilot) within the developer workflow to step-change developer velocity
Graph RAG Architecture: Lead the technical design of 'Graph RAG' (Retrieval-Augmented Generation), creating the patterns that allow LLM agents to query structured Snowflake data via the Neo4j Knowledge Graph
Semantic Layer Design: Design the integration between the physical data layer (Snowflake) and the semantic governance layer (TopQuadrant/TopBraid EDG), ensuring ontologies are mechanically enforced rather than theoretically defined
Polyglot Persistence: define the specific architectural patterns for when data should reside in a Relational Store (Snowflake) versus a Graph Store (Neo4j), and design the high-velocity pipelines (Kafka/Airflow) that keep them in sync
Code Quality & DevOps: Set the standard for code quality. You will be expected to code, review Pull Requests, and enforce strict CI/CD pipelines (unit testing data, schema validation)
Resiliency Architecture: Design self-healing systems. Architect the monitoring and alerting frameworks (SRE) that ensure 99.9% availability for critical pipelines
Mentorship without Authority: Act as a technical mentor to Senior and Lead engineers across multiple squads. Elevate the technical bar of the organization through design reviews, RFCs, and pair programming sessions

Qualification

SnowflakeNeo4jData ArchitecturePythonAirflowAWS/GCPGraph TheoryCI/CDJavaGoInformaticaTableauCommunicationMentorship

Required

10+ years of software engineering experience, with at least 5 years focused on backend distributed systems or data infrastructure at scale
Deep Engineering Roots: You are an expert coder (Python, Java, or Go) who grew up building software. You are comfortable debugging a distributed trace, optimizing a JVM heap, or rewriting a slow SQL query plan
Architectural Expertise: Proven track record of designing large-scale data platforms. You understand the CAP theorem, eventual consistency, and the trade-offs between batch and streaming architectures
Core Stack Mastery: Hands-on expert-level knowledge of Snowflake (internals/clustering), dbt (macro design/Jinja), Airflow (scheduler internals), and Tableau
Graph Database Expertise: Deep understanding of Graph theory and implementation. You know how to model data in Neo4j (Cypher) to avoid super-node problems and optimize traversal performance
Cloud Native Native: Mastery of AWS/GCP services (IAM, VPC, PrivateLink, S3/GCS) and container orchestration (Kubernetes/EKS)
AI/LLM Integration: Experience implementing RAG architectures, vector databases, or integrating LLMs into data pipelines
Influence & Communication: Ability to write clear, persuasive Request for Comments (RFCs) and architectural decision records (ADRs) that drive consensus among other architects and engineering leadership

Company

griddable.io

twittertwitter
company-logo
Griddable.io is a San Jose, CA based SaaS startup that closed Series A funding in 2017 from August Capital, Artiman Ventures, and Carsten Thoma, founding CEO of Hybris (acquired by SAP).

Funding

Current Stage
Early Stage
Total Funding
$8M
2019-01-28Acquired
2018-02-28Series A· $8M

Leadership Team

leader-logo
Burton Hipp
VP of Engineering/Founder
linkedin
Company data provided by crunchbase