SIGN IN
Software Engineer, Data Foundations jobs in United States
cer-icon
Apply on Employer Site
company-logo

Glean · 16 hours ago

Software Engineer, Data Foundations

Glean is an innovative AI-powered knowledge management platform designed to help organizations quickly find, organize, and share information across their teams. The Software Engineer will join the Data Foundations team, responsible for managing the data ingestion and management layer that powers Glean’s products, ensuring data quality and performance for enterprise applications.
Artificial Intelligence (AI)Enterprise SoftwareAgentic AIGenerative AIMachine LearningSearch Engine
check
Work & Life Balance
check
H1B Sponsor Likelynote

Responsibilities

Build and scale connectors to a wide variety of SaaS and on-prem systems (Google Workspace, Microsoft 365, Slack, Salesforce, Jira, ServiceNow, GitHub, etc.)
Handle full syncs, low-latency incremental updates via webhooks/APIs, rate-limiting, and complex authentication flows
Build advanced capabilities in datasources like actions, live-fetch, and query language support
Transform raw, unstructured enterprise content into rich, structured, permission-aware representations optimized for search and LLM reasoning
Design document schemas and enrichment pipelines (entity extraction, access-graph propagation, redactions, etc.)
Expand the capabilities of AI products through deep integrations that allow us to automate tasks, perform complex queries grounded in enterprise data, and enhance our indexed corpus with live data
Own end-to-end correctness, freshness, and performance for petabyte-scale data flows
Solve hard problems in ordering, idempotency, exactly-once processing, backpressure, and retries across distributed queues, workers, and storage
Preserve fine-grained ACLs, deletions, and sensitivity constraints so AI answers are always grounded in what users are actually allowed to see
Partner closely with Search Serving, Product, Platforms, and Security teams to define how enterprise context is exposed to LLMs and agents
Continuously improve observability, alerting, and automation to onboard larger customers and more data sources with confidence

Qualification

Data infrastructure systemsDistributed systemsData pipelinesLarge-scale storageJavaGoC++PythonSLOsError budgetsFailure modesCorrectness guaranteesEnterprise connectorsSearch/indexingInformation retrievalSecurity-sensitive systemsLLMsAI tools

Required

3+ years building production backend or data infrastructure systems (Java, Go, C++, Python, etc.)
Hands-on experience with distributed systems, data pipelines, queues, and large-scale storage (SQL/NoSQL)
You think in SLOs, error budgets, failure modes, and correctness guarantees — not just features
Comfortable with strict consistency and permission-modeling challenges
Prior work on enterprise connectors, search/indexing, information retrieval, or security-sensitive systems is a strong plus
Passionate about making AI trustworthy by building the rock-solid data foundation underneath it
Power user of LLMs and AI tools in your own workflow

Benefits

Medical, Vision, and Dental coverage
Generous time-off policy
Opportunity to contribute to your 401k plan
Home office improvement stipend
Annual education and wellness stipends
Healthy lunches daily

Company

Glean

twittertwittertwitter
company-logo
Glean develops an AI-based search engine software that connects enterprise data and generates answers to improve workplace efficiency.

H1B Sponsorship

Glean has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (22)
2024 (17)
2023 (7)
2022 (26)

Funding

Current Stage
Late Stage
Total Funding
$768.2M
Key Investors
Wellington ManagementAltimeter,DST GlobalKleiner Perkins,Lightspeed Venture Partners
2025-06-10Series F· $150M
2024-09-10Series E· $260M
2024-02-27Series D· $203.2M

Leadership Team

leader-logo
Arvind Jain
Founder and CEO
linkedin
leader-logo
Michael Miao
VP of Finance & BizOps
linkedin
Company data provided by crunchbase