Using Element Clustering to Increase the Efficiency of XML Schema Matching
Smiljanic, Marko and Keulen van, Maurice and Jonker, Willem (2006) Using Element Clustering to Increase the Efficiency of XML Schema Matching. In: 22nd International Conference on Data Engineering Workshops, ICDEW'06, 3-7 April, 2006, Atlanta, Georgia.
| PDF 173Kb |
| Abstract: | Schema matching attempts to discover semantic mappings between elements of two schemas. Elements are cross compared using various heuristics (e.g., name, data-type, and structure similarity). Seen from a broader perspective, the schema matching problem is a combinatorial problem with an exponential complexity. This makes the naive matching algorithms for large schemas prohibitively inefficient. In this paper we propose a clustering based technique for improving the efficiency of large scale schema matching. The technique inserts clustering as an intermediate step into existing schema matching algorithms. Clustering partitions schemas and reduces the overall matching load, and creates a possibility to trade between the efficiency and effectiveness. The technique can be used in addition to other optimization techniques. In the paper we describe the technique, validate the performance of one implementation of the technique, and open directions for future research. |
| Item Type: | Conference or Workshop Item |
| Copyright: | © 2006 IEEE |
| Faculty: | Electrical Engineering, Mathematics and Computer Science (EEMCS) |
| Research Group: | |
| Link to this item: | http://purl.utwente.nl/publications/58927 |
| Official URL: | http://dx.doi.org/10.1109/ICDEW.2006.159 |
| Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page
Metis ID: 238227

Show download statistics for this publication
Show download statistics for this publication