Adaptive Inference of Fine-grained Data Provenance to Achieve High Accuracy at Lower Storage Costs
Huq, Mohammad Rezwanul and Wombacher, Andreas and Apers, Peter M.G. (2011) Adaptive Inference of Fine-grained Data Provenance to Achieve High Accuracy at Lower Storage Costs. In: 7th IEEE International Conference on E-Science, e-Science 2011, 5-8 December 2011, Stockholm, Sweden.
| PDF Restricted to UT campus only: Request a copy 2960Kb |
| Abstract: | In stream data processing, data arrives continuously and is processed by decision making, process control and e-science applications. To control and monitor these applications, reproducibility of result is a vital requirement. However, it requires massive amount of storage space to store fine-grained provenance data especially for those transformations with overlapping sliding windows. In this paper, we propose techniques which can significantly reduce storage costs and can achieve high accuracy. Our evaluation shows that adaptive inference technique can achieve almost 100% accurate provenance information for a given dataset at lower storage costs than the other techniques. Moreover, we present a guideline about the usage of different provenance collection techniques described in this paper based on the transformation operation and stream characteristics. |
| Item Type: | Conference or Workshop Item |
| Copyright: | © 2011 IEEE |
| Faculty: | Electrical Engineering, Mathematics and Computer Science (EEMCS) |
| Research Group: | |
| Link to this item: | http://purl.utwente.nl/publications/79577 |
| Official URL: | http://dx.doi.org/10.1109/eScience.2011.36 |
| Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page

Show download statistics for this publication
Show download statistics for this publication