Taming Data Explosion in Probabilistic Information Integration


Keijzer, Ander de and Keulen, Maurice van and Li, Yiping (2006) Taming Data Explosion in Probabilistic Information Integration. In: International Workshop on Inconsistency and Incompleteness in Databases, IIDB 2006, 26 Mar 2006, Munich, Germany (pp. pp. 82-86).

open access
Abstract:Data integration has been a challenging problem for decades. In autonomous data integration, i.e., without a user to solve semantic uncertainty and conflicts between data sources, it even becomes a serious bottleneck. A probabilistic approach seems promising as it does not require extensive semantic annotations nor user interaction at integration time. It simply teaches the application how to generically cope with uncertainty. Unfortunately, without any world knowledge, uncertainty abounds as almost everything becomes (theoretically) possible and maintaining all possibilities produces huge volumes of data. In this paper, we claim that simple and generic knowledge rules are sufficient to drastically reduce uncertainty, hence tame data explosion to a manageable size.
Item Type:Conference or Workshop Item
Additional information:Position paper. Pre-proceedings can be obtained from workshop website (http://ssi.umh.ac.be/iidb) or Jef Wijsen, Institut d'Informatique, Université de Mons-Hainaut, B-7000 Mons, Belgium.
Science and Technology (TNW)
Electrical Engineering, Mathematics and Computer Science (EEMCS)
Research Group:
Link to this item:http://purl.utwente.nl/publications/66509
Official URL:ftp://ftp.umh.ac.be/pub/ftp_ssi/jef/iidbproc.pdf
Export this item as:BibTeX
HTML Citation
Reference Manager


Repository Staff Only: item control page

Metis ID: 238693