Octagon · 2 months ago
Data Engineer (AI Enablement)
Octagon is a global sports, entertainment, and experiential marketing company recognized as one of the “Best Places to Work in Sports”. They are seeking a Data Engineer (AI Enablement) responsible for building and operating data foundations that support AI solutions and enterprise search, collaborating with various teams to enhance data quality and enable AI-powered workflows.
AdvertisingBrand Marketing
Responsibilities
Design and operate the vector database/search layer (e.g., FAISS/pgvector/Milvus) and document-chunking/embedding pipelines that make Octagon’s content discoverable and auditable
Implement and maintain ELT/ETL to support downstream workflows such as data labeling, classification, and document parsing; build robust validations, lineage, and observability
Expose governed retrieval endpoints that respect permissions (ACLs), support metadata filters, and return source snippets/IDs for grounding and citations
Normalize, transform, and move JSON and other structured payloads cleanly through workflows to ensure reliable handoffs and automation outputs
Align product peers, design, data science, engineering, and commercial teams around a unified roadmap and shared data contracts
Take MVPs from the Solutions Engineer and productionize with CI/CD, telemetry, cost/usage guardrails, and pilot → rollout gating
Build monitoring (freshness, re-index SLAs, retrieval quality), secrets management, access controls, and audit logging aligned with enterprise governance
Flexibility and willingness to travel and work weekends or holidays as needed
Qualification
Required
3+ years (or equivalent portfolio) building data systems: data modeling, ELT/ETL, Python + SQL; experience with cloud object storage and relational databases
Hands-on with embeddings and vector databases (e.g., FAISS/pgvector/Milvus) and document processing pipelines for RAG-style retrieval
Scalable pipeline experience supporting AI/ML/LLM use cases (labeling, classification, doc parsing) and partnering closely with Data Science and Data Labeling teams
Data structuring & manipulation expertise: cleanly normalizing and transforming JSON/Parquet/CSV payloads; designing resilient data contracts and schemas
Orchestration/ops: Airflow/Prefect (or similar), CI/CD, structured logging/monitoring, cost/usage guardrails; secure secrets management
Strong collaboration and communication skills; proven ability to align product/design/engineering/commercial stakeholders around a unified roadmap
Preferred
Enterprise connectors and productivity stacks (e.g., Microsoft 365/SharePoint/Teams/Graph, Copilot or Copilot Studio/Power Automate; Google Workspace; Salesforce; DAMs)
Experience implementing LLM inference patterns, similarity search, guardrails, and memory; familiarity with agent frameworks or custom orchestration
Additional languages for systems work (e.g., C++, C#, Java, or Go)
Containers (Docker), GitHub Actions, IaC; lightweight internal UIs (Streamlit or R Shiny) to expose services
Familiarity with marketing/media-measurement datasets and associated normalization/quality checks
Benefits
Unlimited PTO policy – we understand you need time for play!
Competitive medical/dental/vision insurance plans with FSA/HSA and Dependent Care FSA options. Pet Insurance for those who need it too!
Generous Family and Parental Leave Policy (12 weeks) with eligibility extended to all parents regardless of gender or primary/secondary caregiver status
Access to our parent company (IPG) Savings plan (401K program) with company match as well as an Employee Stock Purchase Plan (ESPP)
Pretax Transportation/Commuter Benefits and Parent Travel Program
Dedicated Mental Health resources including Headspace membership, Employee Assistance Program (CCA) and more
Discount portal for everyday goods and services
Employee Resource Groups and inclusive diversity programming and initiatives
Personal Development programs
Company
Octagon
Octagon is the world's leading sports and entertainment management and marketing company. It is a sub-organization of Interpublic Group.
Funding
Current Stage
Late StageRecent News
Sports Business Journal
2025-10-11
Sports Business Journal
2025-10-09
Company data provided by crunchbase