Taming Data Explosion in Probabilistic Information Integration


Keijzer de, Ander and Keulen van, Maurice and Li, Yiping (2006) Taming Data Explosion in Probabilistic Information Integration. In: International Workshop on Inconsistency and Incompleteness in Databases, IIDB 2006, 26 Mar 2006, Munich, Germany.

Abstract:Data integration has been a challenging problem for decades. In autonomous data integration, i.e., without a user to solve semantic uncertainty and conflicts between data sources, it even becomes a serious bottleneck. A probabilistic approach seems promising as it does not require extensive semantic annotations nor user interaction at integration time. It simply teaches the application how to generically cope with uncertainty. Unfortunately, without any world knowledge, uncertainty abounds as almost everything becomes (theoretically) possible and maintaining all possibilities produces huge volumes of data. In this paper, we claim that simple and generic knowledge rules are sufficient to drastically reduce uncertainty, hence tame data explosion to a manageable size.
Item Type:Conference or Workshop Item
Science and Technology (TNW)
Electrical Engineering, Mathematics and Computer Science (EEMCS)
Research Group:
Link to this item:http://purl.utwente.nl/publications/66509
Official URL:ftp://ftp.umh.ac.be/pub/ftp_ssi/jef/iidbproc.pdf
Export this item as:BibTeX
HTML Citation
Reference Manager


Repository Staff Only: item control page

Metis ID: 238693