OAEI 2012::Large BioMed Track

Results OAEI 2012::Large BioMed Track

Results OAEI 2012 SNOMED-NCI matching problem

The matching outputs in the SNOMED-NCI matching problem have only been compared against the original UMLS mapping and the refined subset computed by LogMap's repair facility. We could not compute a refined UMLS alignment set with Alcomo debugging system since, at the time of creating the datasets, it could not cope with the integration of SNOMED and NCI via mappings. The new version of Alcomo, however, has shown to be able to provide such refined set.

The satisfiability results, since currently no OWL 2 reasoner has shown to cope with the integration of SNOMED and NCI via mappings [url], have been estimated using the Dowling-Gallier algorithm [url] for propositional Horn satisfiability (implemented in LogMap's repair facility).

SNOMED-NCI small fragments

The SNOMED-NCI matching problem moves to a next level of difficulty with respect to the FMA-SNOMED matching problem and, in general, runtimes and results are slightly worse. Furthermore, Hertuda and HotMatch, which were able to complete the small FMA-NCI and the small FMA-SNOMED tasks, failed to complete the small SNOMED-NCI task in less than 24 hours.

Six systems provided an F-measure higher than our baseline LogMapLt and their F-measures were very close to each other. On the other hand, GOMMA, MapSSS and AROMA failed to top LogMapLt results. LogMap-noe provided the best results in terms of recall and F-measure while ServOMap generated the most precise mappings.

As in the FMA-NCI and FMA-SNOMED matching problems, precision tend to increase when comparing against the original UMLS mapping set, while recall decreases.

The runtimes were also positive in general and 7 systems completed the task in less than 4 minutes. YAM++ required more than 30 minutes, while AROMA and MapSSS needed 4 and 8 hours to complete the task, respectively.

LogMap (with its two variants) generated a set of output mappings that did not lead to any unsatisfiable class when reasoning (using Dowling-Gallier algorithm) together with the input ontologies. The rest of the systems generated mapping sets that lead to a degree of incoherence greater than 50%.

System	Time (s)	# Mappings	Original UMLS			Refined UMLS (LogMap)			Average			Incoherence Analysis
System	Time (s)	# Mappings	Precision	Recall	F-measure	Precision	Recall	F-measure	Precision	Recall	F-measure	All Unsat.	Degree	Root Unsat.
LogMap-noe	211	13,525	0.897	0.644	0.750	0.893	0.659	0.758	0.895	0.652	0.754	0	0%	0
LogMap	221	13,454	0.899	0.642	0.749	0.895	0.657	0.758	0.897	0.649	0.753	0	0%	0
GOMMA_Bk	226	12,294	0.946	0.617	0.747	0.931	0.625	0.748	0.939	0.621	0.747	48,681	64.83%	863
YAM++	1,901	11,961	0.951	0.604	0.739	0.940	0.614	0.743	0.946	0.609	0.741	50,089	66.71%	471
ServOMapL	147	11,730	0.960	0.598	0.737	0.947	0.606	0.739	0.954	0.602	0.738	62,367	83.06%	657
ServOMap	153	10,829	0.972	0.558	0.709	0.959	0.567	0.713	0.965	0.563	0.711	51,020	67.95%	467
LogMapLt	54	10,947	0.953	0.554	0.700	0.938	0.560	0.701	0.945	0.557	0.701	61,269	81.60%	801
GOMMA	197	10,555	0.948	0.531	0.680	0.931	0.536	0.680	0.939	0.533	0.680	42,813	57.02%	851
AROMA	15,624	11,783	0.861	0.538	0.662	0.848	0.545	0.664	0.854	0.542	0.663	70,491	93.88%	1,286
MapSSS	27,381	9,608	0.795	0.405	0.537	0.783	0.411	0.539	0.789	0.408	0.538	46,083	61.37%	794

SNOMED-NCI big fragments

MapSSS and AROMA failed to complete the task involving the big fragments of FMA and SNOMED after more than 24 hours of execution.

There were not big differences, in general, in terms of F-measure with respect to the small SNOMED-NCI task. Only LogMap decreased their recall and lost its second position and GOMMA-bk generated less precise mappings and was relegated to the sixth position. As in previous task, LogMap-noe provided the best results in terms of recall and F-measure while ServOMap generated the most precise mappings.

Runtimes were between 2 and 3 orders of magnitude bigger than in the small task, but in the most of the cases the task was finished in less than 10 minutes.

Regarding mapping coherence, LogMap-noe provided a clean output while LogMap, since it computes an estimation of the overlapping (fragments) between the input ontologies, failed to detect and repair 3 unsatisfiable classes, which were outside the computed ontology fragments.

System	Time (s)	# Mappings	Original UMLS			Refined UMLS (LogMap)			Average			Incoherence Analysis
System	Time (s)	# Mappings	Precision	Recall	F-measure	Precision	Recall	F-measure	Precision	Recall	F-measure	All Unsat.	Degree	Root Unsat.
LogMap-noe	575	13,184	0.882	0.617	0.726	0.877	0.631	0.734	0.879	0.624	0.730	0	0%	0
YAM++	6,127	13,083	0.864	0.600	0.708	0.854	0.610	0.712	0.859	0.605	0.710	104,492	60.66%	618
ServOMapL	363	12,784	0.870	0.590	0.703	0.858	0.599	0.705	0.864	0.594	0.704	136,909	79.48%	1,101
LogMap	514	12,142	0.877	0.565	0.687	0.872	0.578	0.695	0.874	0.571	0.691	3	0.002%	2
ServOMap	282	11,632	0.896	0.553	0.684	0.885	0.562	0.687	0.891	0.558	0.686	110,253	64.00%	820
GOMMA_Bk	638	15,644	0.730	0.606	0.662	0.718	0.613	0.662	0.724	0.610	0.662	116,451	67.60%	2,741
LogMapLt	104	12,741	0.819	0.553	0.660	0.805	0.560	0.661	0.812	0.557	0.661	131,073	76.09%	2,201
GOMMA	527	12,320	0.802	0.524	0.634	0.787	0.529	0.633	0.795	0.527	0.634	96,945	56.28%	1,621
AROMA	-	-	-	-	-	-	-	-	-	-	-	-	-	-
MapSSS	-	-	-	-	-	-	-	-	-	-	-	-	-	-

SNOMED-NCI whole ontologies

The precision and recall slightly decreased in all systems and none of them could reach an F-measure of 0.7. YAM++ produced the best mapping set in terms of F-measure, while ServOMap and GOMMA-bk generated the mappings with best precision and recall, respectively. LogMap-noe lost its first position since it provided less comprehensive mappings.

ServOMap, ServOMapL and LogMapwere the fastest tools and required 11, 12 and 16 minutes respectively. GOMMA (with its two variations) required more than 30 minutes, while YAM++ required more than 8 hours.

As in previous task, LogMap-noe provided a clean output while LogMap failed to detect and repair a few unsatisfiable classes due to the computation of the overlapping between the input ontologies.

System	Time (s)	# Mappings	Original UMLS			Refined UMLS (LogMap)			Average			Incoherence Analysis
System	Time (s)	# Mappings	Precision	Recall	F-measure	Precision	Recall	F-measure	Precision	Recall	F-measure	All Unsat.	Degree	Root Unsat.
YAM++	30,155	14,103	0.794	0.594	0.680	0.785	0.604	0.683	0.790	0.599	0.681	238,593	63.91%	979
ServOMapL	738	13,964	0.796	0.590	0.678	0.785	0.598	0.679	0.791	0.594	0.678	286,790	76.82%	1,557
LogMap	955	13,011	0.816	0.564	0.667	0.812	0.577	0.674	0.814	0.570	0.671	16	0.004%	10
LogMap-noe	1,505	13,058	0.813	0.563	0.666	0.809	0.577	0.673	0.811	0.570	0.670	0	0%	0
ServOMap	654	12,462	0.835	0.552	0.664	0.824	0.560	0.667	0.829	0.556	0.666	230,055	61.63%	1,546
GOMMA_Bk	1,940	17,045	0.669	0.605	0.635	0.658	0.612	0.634	0.663	0.608	0.635	239,708	64.21%	4,297
LogMapLt	178	14,043	0.743	0.553	0.634	0.731	0.560	0.634	0.737	0.557	0.634	305,648	81.87%	3,160
GOMMA	1,820	13,693	0.720	0.523	0.606	0.707	0.528	0.605	0.714	0.526	0.606	215,959	57.85%	2,614
AROMA	-	-	-	-	-	-	-	-	-	-	-	-	-	-
MapSSS	-	-	-	-	-	-	-	-	-	-	-	-	-	-