MOKA is a MSCA Postdoctoral Fellowships in collaboration with ORKG group. The goal of this project is to design and implement methods for managing the evolution of large and dynamic knowledge graphs. To do so, we will (i) identify and characterize changes that occurs in KG when the domain evolves; (ii) define a mechanism to represent the evolution of a domain within a so-called Historical Knowledge Graph (HKG); (iii) explore the potential of LLMs to support the maintenance of large KGs.
Live languages continuously evolve to integrate the cultural change in human societies. This project use Knowledge Graphs and LLM to understanding and characterise the changes in the meaning of words in order to support the automatization of tasks like translation, sentiment analysis, question-answering, etc. We formally define three classes of characterisations and we implement algorithms to detect them: (1) If the meaning of a word becomes more general or narrow (change in dimension). (2) If the word is used in a more pejorative or positive/ameliorated sense (change in orientation), and (3) if there is a trend to use the word in a, for instance, metaphoric or metonymic context (change in relation).
Creating graphs is key way of digesting knowledge and obtaining a snap shot of evolution of the subject concerned. This is the focus of LIST’s Historical Knowledge Graph for COVID-19 project, or HKG4COVID for short. The idea of the project and LIST’s contribution is to add the temporal dimension of the disease to graphs. Currently all the graphs that exist just represent current knowledge situation. We do not have the global understanding, and temporal evolution understanding of that knowledge over time, so this is what we will add.
Dynaccurate is a new Artificial Intelligence Web based application that solves the issues caused by the constant evolution of terminologies, helping data dependent organisations to optimise data exploitability. Dynaccurate is the product of seven years of dedicated research and engineering carried out by the Luxembourgish Institute of Science and Technology (LIST) into resolving the issue of how legacy database information can be easily, efficiently and accurately updated, so that older data can be continuously updated with new annotations, preserving the value of the data and reducing or eliminating inaccuracies or out-of-date references.
The efficient management and exploitation of digital information is pushing companies to rely on Semantic Web technologies. Ontologies offer the means to make the semantics of data explicit by annotating available data with concept labels that make it possible for machines to understand the annotated data. This is the case, for instance, in the health sector where patient data stored in electronic health records (EHRs) are associated with concept codes or terms borrowed from standard controlled terminologies such as the International Classification of Diseases (ICD) or SNOMED CT, facilitating data exchange between health professionals. However, the dynamic nature of domain knowledge forces engineers to revise the content of ontologies, creating a mismatch between the definition of concepts and the annotations, thus preventing any intelligent exploitation of the data. In this context, and in direct line with the results of the DynaMO project, ELISA will develop innovative concepts and tools to: Understand and characterize the evolution of ontologies over time with respect to the problem of semantic annotation evolution, Maintain the semantic annotations impacted by the evolution of ontologies they derived from. Two cases will be distinguished: - A direct modification if the annotations are modifiable, - An ad-hoc modification if the annotations are not accessible. This will be done through the design of a query enrichment mechanism reflecting the evolution of ontology in order to keep annotated data searchable over time. The proposed technology will help companies in managing the ever-increasing quantity of data they have to deal with. Moreover, it will be implemented in two real cases borrowed from the field of life sciences.