Query-Based Sampling using Only Snippets
Tigelaar, Almer S. and Hiemstra, Djoerd (2009) Query-Based Sampling using Only Snippets. [Report]
| PDF 563Kb |
| Abstract: | Query-based sampling is a popular approach to model the content of an uncooperative server. It works by sending queries to the server and downloading the returned documents in the search results in full. This sample of documents then represents the server’s content. We present an approach that uses the document snippets as samples instead of downloading entire documents. This yields more stable results at the same amount of bandwidth usage as the full document approach. Additionally, we show that using snippets does not necessarily incur more latency, but can actually save time. |
| Item Type: | Report |
| Copyright: | © 2009 University of Twente, Centre for Telematics and Information Technology |
| Faculty: | Electrical Engineering, Mathematics and Computer Science (EEMCS) |
| Research Group: | |
| Link to this item: | http://purl.utwente.nl/publications/68676 |
| Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page

Show download statistics for this publication
Show download statistics for this publication