Skip to main content

Ontology−Mediated Queries for Probabilistic Databases

Stefan Borgwardt‚ İsmail İlkan Ceylan and Thomas Lukasiewicz

Abstract

Probabilistic databases (PDBs) are usually incomplete, e.g., containing only the facts that have been extracted from the Web with high confidence. However, missing facts are often treated as being false, which leads to unintuitive results when querying PDBs. Recently, open-world probabilistic databases (OpenPDBs) were proposed to address this issue by allowing probabilities of unknown facts to take any value from a fixed probability interval. In this paper, we extend OpenPDBs by Datalog+/– ontologies, under which both upper and lower probabilities of queries become even more informative, enabling us to distinguish queries that were indistinguishable before. We show that the dichotomy between P and PP in (Open)PDBs can be lifted to the case of first-order rewritable positive programs (without negative constraints); and that the problem can become NP^PP-complete, once negative constraints are allowed. We also propose an approximating semantics that circumvents the increase in complexity caused by negative constraints.

Book Title
Proceedings of the 31st National Conference on Artificial Intelligence‚ AAAI 2017‚ San Francisco‚ California‚ USA‚ February 4–9‚ 2017
Editor
Satinder Singh and Shaul Markovitch
Pages
1063–1069
Publisher
AAAI Press
Year
2017