Query-Based Sampling using Snippets
Tigelaar, Almer S. and Hiemstra, D. (2010) Query-Based Sampling using Snippets. In: Eighth Workshop on Large-Scale Distributed Systems for Information Retrieval, 23 Jul 2010, Geneva, Switzerland.
| PDF 459Kb |
| Abstract: | Query-based sampling is a commonly used approach to model the content of servers. Conventionally, queries are sent to a server and the documents in the search results returned are downloaded in full as representation of the server’s content. We present an approach that uses the document snippets in the search results as samples instead of downloading the entire documents. We show this yields equal or better modeling performance for the same bandwidth consumption depending on collection characteristics, like document length distribution and homogeneity. Query-based sampling using snippets is a useful approach for real-world systems, since it requires no extra operations beyond exchanging queries and search results. |
| Item Type: | Conference or Workshop Item |
| Faculty: | Electrical Engineering, Mathematics and Computer Science (EEMCS) |
| Research Group: | |
| Link to this item: | http://purl.utwente.nl/publications/72429 |
| Official URL: | http://www.lsdsir.org/wp-content/uploads/2010/05/LSDSIR2010.pdf |
| Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page

Show download statistics for this publication
Show download statistics for this publication