Property Modeling for Product Ontology

21. March 2024

Identifying entities and relationships from heterogeneous data sources in the context of technical documentation is an important part of building a knowledge database.

Technical data consists of tables, plain text, and images related to various products. We use pre-trained LLM and OCR models to identify products and product attributes from these sources. The extracted product information is now disambiguated using vector embeddings and mapped to specific entities and relationships in our PIM ontology.

This use of AI tools helps us to build a much more concrete knowledge base for our customers compared to standard data transformation approaches that only work with structured data and are rule-based.

We will be happy to inform you regularly about new articles, videos or podcast episodes.

Knowledge

Learn more about connecting information

Records

18. May 2026

Do We Still Need Metadata in the Age of AI?

In this webinar, Karsten Schrempp (PANTOPIX) and Jörg Schmidt (RWS) will show why metadata remains essential in technical communication today.

View article

Records

16. November 2024

Knowledge Graph Embeddings in the Industry

View article

Records

21. March 2024