Information Systems Group

― Knowledge Representation and Reasoning

LogMap: Logic-based Methods for Ontology Mapping

An ISG project in Knowledge Representation and Reasoning

Description

Ontologies are extensively used in biology and medicine. A prominent example of a bio-medical ontology is SNOMED CT, which is a core component of the NHS patient record service. Other examples include the Foundational Model of Anatomy (FMA) and the National Cancer Institute Thesaurus (NCI). Ontologies such as SNOMED CT, FMA, and NCI are gradually superseding the existing medical classifications and are becoming core platforms for accessing, gathering, and sharing medical knowledge and data.

To exchange or migrate data between ontology-based applications, it is crucial to establish correspondences (or mappings) between their ontologies. Creating such mappings manually is often unfeasible due to the size and complexity of modern ontologies. Therefore, the problem of automatically generating mappings between ontologies (often referred to as the ontology matching, ontology alignment, or ontology mapping problem) has been investigated extensively in recent years. Despite the already mature state of the art, bio-medical ontologies still pose serious challenges to existing techniques.

First, carefully-curated mapping sets used in bio-medical information integration, such as UMLS Metathesaurus, often contain errors and lack important information. Second, existing mapping generation tools do not scale to the size of modern bio-medical ontologies. Finally, the developers of bio-medical information integration and migration systems based on ontologies currently lack the necessary tool support.

In this research, we aim to address each of these needs. Therefore, we have identified three main objectives:

  1. To develop general principles and specific techniques to efficiently detect and repair potential errors and discover missing information in manually-curated mapping sets, such as the UMLS Metathesaurus.

  2. To develop computationally efficient algorithms for generating mappings between large-scale ontologies, while minimising the number of errors and the amount of missing information.

  3. To design and implement a prototype with core functionality for mapping generation and management, as well as for the translation and migration of ontology data.

Our main research hypothesis is based on the observation that existing techniques for ontology mapping often disregard the logic-based semantics of the input ontologies. As a result, they fail to take advantage of the available semantics, and of the highly effective reasoning services for modern ontology languages. We are proposing to rethink the foundations underlying the current state-of-the art in the field by incorporating reasoning-based techniques in each of the steps of the ontology mapping process. We also intend to go even further and make our techniques practical and ready to be used in applications. To this end, we aim to develop suitable heuristics that can be efficiently implemented. The research is based on our preliminary empirical evidence which suggests the potential benefits of logic-based heuristic techniques when analysing existing mappings between biomedical ontologies.

We expect that our results will be directly relevant to the users of ontology-based systems in the bio-medical domain, where knowledge and data integration is a matter of major concern.


RESOURCES


Used Ontologies:


UMLS Repair (includes original and repaired UMLS mappings)


LogMap Extraction (includes computed mappings and ontology overlapping). Output from OAEI 2012.

  • Coming soon

LogMap Extraction (includes computed mappings and ontology overlapping). September 2011.


Support

LogMAP is sponsored by the UK Engineering and Physical Sciences Research Council (EPSRC).

Key Publications

[View all]

Ernesto Jimenez Ruiz‚ Bernardo Cuenca Grau‚ Yujiao Zhou and Ian Horrocks
Large−scale Interactive Ontology Matching: Algorithms and Implementation. [.pdf]
In the 20th European Conference on Artificial Intelligence (ECAI 2012).

Ernesto Jiménez-Ruiz, Bernardo Cuenca Grau
LogMap: Logic-based and Scalable Ontology Matching. [.pdf]
In the 10th International Semantic Web Confernece (ISWC 2011)

Ernesto Jiménez-Ruiz, Bernardo Cuenca Grau, Ian Horrocks
On the Feasibility of Using OWL 2 DL Reasoners for Ontology Matching Problems. [.pdf]
In the 1st OWL Reasoner Evaluation Workshop (ORE 2012)

Ernesto Jiménez-Ruiz, Antón Morant, Bernardo Cuenca Grau.
LogMap results for OAEI 2011. [.pdf]
In the 6th International Workshop on Ontology Matching (OM 2011)

Ernesto Jiménez-Ruiz, Bernardo Cuenca Grau.
Towards more Challenging Problems for Ontology Matching Tools. [.pdf] [poster]
In the 6th International Workshop on Ontology Matching (OM 2011)

Ernesto Jiménez-Ruiz, Bernardo Cuenca Grau, Yujiao Zhou
LogMap 2.0: towards logic-based, scalable and interactive ontology matching. [.pdf] [poster]
In the 4th International Workshop on Semantic Web Applications and Tools for Life Sciences (SWAT4LS 2011)

Technical Reports

Antón Morant. Extending and Optimizing an Ontology Matching System. [.pdf]
Master thesis. Oxford University Department of Computer Science. September 2011. Suvervised by Bernardo Cuenca Grau and Ernesto Jiménez-Ruiz

Previous Related Work

Ernesto Jiménez-Ruiz, Bernardo Cuenca Grau, Ian Horrocks, and Rafael Berlanga Llavori.
Ontology Integration Using Mappings: Towards Getting the Right Logical Consequences.
In Proc. of the 6th European Semantic Web Conf. (ESWC 2009), Volume 5554 of Lecture Notes in Computer Science, pages 173-187.
BibTeX-Entry | Pdf ]

Ernesto Jiménez-Ruiz, Bernardo Cuenca Grau, Rafael Berlanga Llavori, and Ian Horrocks. Logic-based Assessment of the Compatibility of UMLS Ontology Sources. In BMC Journal of Biomedical Semantics 2011, 2(Suppl 1):S2, ISSN 1613-0073. (http://www.jbiomedsem.com/content/2/S1/S2), 2011.
[ bib | .pdf ]