Lectures
Property Modelling for Product Ontology using Vector Embeddings driven by LLMS and OCR
Identifying entities and relationships from heterogeneous data sources in the context of technical documentation is an important part of building a knowledge database. Technical data consists of tables, raw texts and images for various products. We use pre-trained LLM and OCR models to identify products and product attributes from these sources. The extracted product information is now disambiguated using vector embeddings and mapped to specific entities and relationships in our PIM ontology. This use of AI tools helps us build a much more concrete knowledge base for our customers compared to standard data transformation approaches that only work with structured data and are rule-based.

Unlocking the Potential of Legacy Data with AI and Knowledge Graphs
Sofia Darie, Nikhil Achary. Unlocking the potential of legacy data with AI and Knowledge Graphs. Information Energy, April 2025.

From Legacy Systems to DITA: Case Studies in Successful Migration
Sofia Darie, Karsten Schrempp. From Legacy Systems to DITA: Case Studies in Successful Migration, February 2025.

Product ontologies as backbone for GenAI
Maximilian Gärber, Prof. Dr. Martin Ley. Product ontologies as backbone for GenAI. tcworld, November 2024.
Contact us
Maraike Heim
Head of Marketing
- maraike.heim@pantopix.com
