Forward Deployed Engineer, Unstructured AI jobs in United States
cer-icon
Apply on Employer Site
company-logo

Collibra · 1 month ago

Forward Deployed Engineer, Unstructured AI

Collibra is a company focused on managing unstructured data and they are seeking a Forward Deployed Engineer to join their Unstructured AI Team. The role involves owning the technical delivery of Unstructured AI deployments and building full-stack systems to process large volumes of unstructured content.

AnalyticsArtificial Intelligence (AI)Data IntegrationData ManagementEnterprise SoftwareInfrastructure
check
Comp. & Benefits
badNo H1Bnote

Responsibilities

Own end-to-end technical delivery of Unstructured AI deployments — from first prototype to stable production across enterprise environments
Build and scale full-stack systems that process and enrich large volumes of unstructured content (PDFs, contracts, reports, and other document types)
Embed closely with customer and field teams to understand their metadata, governance, and security needs - guiding how Unstructured AI integrates into their broader Collibra stack
Scope work, sequence delivery, and remove blockers early to ensure fast iteration cycles between product, research, and deployment teams
Balancing scope, speed, and quality - making clear trade-offs to keep pilots moving and convert them into production rollouts
Codifying repeatable patterns from customer projects into reusable connectors, enrichment modules, or playbooks that accelerate future deployments
Feeding field insights back to Product and Research, identifying opportunities to improve product experience
Keep cross-functional teams aligned through clear communication, prioritization, and follow-through

Qualification

PythonDocument processing systemsCloud infrastructureData pipelinesAI-driven enrichment modelsMicroservice architectureInfrastructure as CodeStakeholder managementDecision-making under pressureCommunication skills

Required

Shipped complex systems under ambiguity - balancing speed and precision in real customer environments
Written and reviewed production-grade code across backend (Python, FastAPI)
Built or deployed document-processing systems and are comfortable with CI/CD, monitoring, and debugging tools
2+ years of software engineering or technical deployment experience, ideally involving enterprise integrations, AI data processing, or customer-facing delivery
Strong proficiency in Python (data processing, API development, and integrations)
Proven ability to deliver production-grade systems that process large-scale unstructured data (PDFs, text, documents)
Solid understanding of data pipelines, microservice architecture, and API design
Experience with cloud infrastructure (AWS, GCP, or Azure), Infrastructure as Code (Terraform) and containerization (Docker / Kubernetes)
Experience with LLM-based or AI-driven enrichment models (classification, extraction, deduplication, PII detection)
Familiar with metadata systems, data cataloging, or document AI workflows
Background in data governance, sensitive data detection, or enterprise integrations (Collibra, Databricks, Snowflake, etc.)
A track record of codifying repeatable deployment patterns into tools, SDKs, or frameworks
Knowledge of security, compliance, and model evaluation best practices
A bachelor's degree or equivalent work experience is required

Preferred

Capable of communicating clearly across engineering, product, and field teams, ensuring alignment from prototype to rollout
Experienced in spotting risks early, course-correcting without friction, and model composure when delivery timelines are tight
Someone who cares deeply about data quality, precision, and governance
Willing to gain hands-on experience with modern frontend development
Able to translate customer requirements into technical plans and deliver end-to-end
Strong communication and stakeholder-management skills across technical and business teams
Calm, structured decision-making under tight timelines or ambiguity

Benefits

Equity ownership at every level
Bonus potential
Flex Fund monthly stipend
Pension/401k plans
Competitive compensation
Health coverage
Time off

Company

Collibra

company-logo
Collibra delivers an end-to-end Data Intelligence platform to accelerate digital business transformation.

Funding

Current Stage
Late Stage
Total Funding
$596.52M
Key Investors
CapitalGICONIQ GrowthIndex Ventures
2022-01-11Series Unknown
2021-11-09Series G· $250M
2020-04-02Series F· $112.5M

Leadership Team

leader-logo
Felix Van De Maele
Founder, CEO
linkedin
leader-logo
Stijn Christiaens
Co-founder & Chief Data Citizen
linkedin
Company data provided by crunchbase