Software Engineer, Enterprise Data Platform jobs in United States
cer-icon
Apply on Employer Site
company-logo

Notion · 15 hours ago

Software Engineer, Enterprise Data Platform

Notion is a company that provides a unified platform for teams to enhance productivity by connecting various tools and applications. They are seeking a Software Engineer for their Data Platform team to design and build a core data platform that supports AI, analytics, and search while ensuring security and compliance. The role involves working on data pipelines, lakehouse components, and improving the overall reliability and performance of the data infrastructure.

AppsCollaborationProduct ManagementReal TimeSoftware
check
H1B Sponsor Likelynote

Responsibilities

Design and evolve the data lakehouse
Build and operate core lakehouse components (e.g., Iceberg/Hudi/Delta tables, catalogs, schema management) that serve as the source of truth for analytics, AI, and search
Own critical data pipelines and services
Design, implement, and harden batch and streaming pipelines (Spark, Kafka, EMR, etc.) that move and transform data reliably across regions and cells
Advance EKM and encryption-by-design
Work with Security and platform teams to integrate Enterprise Key Management (EKM) into data workflows, including file- and record-level encryption and safe key handling in Spark and storage systems
Improve data access, auditability, and residency
Build primitives for fine-grained access control, auditing, and data residency so customers can see who accessed what, where, and under which guarantees
Drive reliability and observability
Raise the operational bar for our data stack: improve on-call experience, debugging, and alerting for data jobs and services
Optimize large-scale performance and cost
Tackle performance and cost challenges across Kafka, Spark, and storage for very large workspaces (20k+ users, multi-cell deployments), including cluster migrations and workload tuning
Enable ML and search workflows
Build infrastructure to support training and inference pipelines, ranking workflows, and embedding infrastructure on top of the shared data platform
Shape the platform roadmap
Contribute to design docs and evaluations that influence our long-term platform direction and vendor choices

Qualification

Data platform experiencePythonSparkKafkaCloud infrastructureSQLData lakesAccess controlEncryptionIncident responseReliability improvementsSoft skills

Required

5+ years building and operating data platforms or large-scale data infrastructure for SaaS or similar environments
Strong skills in at least one of Python, Java, or Scala; comfortable working with SQL for analytics and data modeling
Hands-on experience with Spark or similar distributed processing systems, including debugging and performance tuning
Experience with Kafka or equivalent streaming systems; familiarity with CDC/ingestion patterns (e.g., Debezium, Fivetran, custom connectors)
Experience with data lakes and table formats (Iceberg, Hudi, or Delta) and/or data catalogs and schema evolution
Practical understanding of access control, encryption at rest/in transit, and auditing as they apply to data platforms
Experience with at least one major cloud provider (AWS, GCP, or Azure) and managed data/compute services (e.g., EMR, Dataproc, Kubernetes-based compute)
Comfortable owning services and pipelines in production, including on-call, incident response, and reliability improvements

Preferred

Experience working directly with enterprise customers or on features like data residency, EKM, or compliance-driven auditing
Prior work on Databricks, Unity Catalog, Lake Formation, or similar catalog/governance systems
Background implementing multi-region / multi-cell data architectures
Experience building ML training/eval workflows or model/feature stores on top of a shared data platform
Familiarity with vector databases or search infrastructure, and how they integrate with upstream data systems
Experience designing or improving observability for data platforms (e.g., Honeycomb, OpenTelemetry, metrics/trace-heavy debugging)

Company

Notion

twittertwittertwitter
company-logo
Notion is a workspace platform that offers note-taking, collaboration, task management, wikis, and databases.

H1B Sponsorship

Notion has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (31)
2024 (23)
2023 (9)
2022 (22)
2021 (7)
2020 (2)

Funding

Current Stage
Late Stage
Total Funding
$343.2M
Key Investors
Index VenturesFirst Round Capital
2025-12-15Secondary Market
2021-10-08Series C· $275M
2020-04-02Series B· $50M

Leadership Team

leader-logo
Akshay Kothari
COO
linkedin
leader-logo
Ha Nguyen
Global Head of Demand Generation
linkedin
Company data provided by crunchbase