Lectures
March 21, 2024
Property Modelling for Product Ontology using Vector Embeddings driven by LLMS and OCR
Identifying entities and relationships from heterogeneous data sources in the context of technical documentation is an important part of building a knowledge database. Technical data consists of tables, raw texts and images for various products. We use pre-trained LLM and OCR models to identify products and product attributes from these sources. The extracted product information is now disambiguated using vector embeddings and mapped to specific entities and relationships in our PIM ontology. This use of AI tools helps us build a much more concrete knowledge base for our customers compared to standard data transformation approaches that only work with structured data and are rule-based.
Subscribe to the free newsletter from PANTOPIX.
We will gladly keep you informed regularly about new webinars.
What Makes Intelligent Content Indispensable? An Expert-led Roundtable Discussion
Learn in this webinar with Karsten Schrempp, why intelligent content authoring is indispensable.
Knowledge Graph Embeddings in the Industry
In this webinar, Nikhil Acharya will introduce you why knowledge graphs are extremely useful when we need to compare hierarchical relationships, properties and links of different data models.
Delivery-First Approach: The Key to Faster ROI in Content Projects
Learn how Dynamic Content Delivery generates added value from your existing content, regardless of the tools and methods currently being used.
Contact us
Prof. Dr. Martin Ley
Senior Consultant
- martin.ley@pantopix.com