The Importance of Prior Probabilities for Entry Page Search
Kraaij, W. and Westerveld, T.H.W. and Hiemstra, D. (2002) The Importance of Prior Probabilities for Entry Page Search. In: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2002), 11-15 Aug 2002, Tampere, Finland.
| PDF Restricted to UT campus only: Request a copy 132Kb |
| Abstract: | An important class of searches on the world-wide-web has the goal to find an entry page (homepage) of an organisation. Entry page search is quite different from Ad Hoc search. Indeed a plain Ad Hoc system performs disappointingly. We explored three non-content features of web pages: page length, number of incoming links and URL form. Especially the URL form proved to be a good predictor. Using URL form priors we found over 70% of all entry pages at rank 1, and up to 89% in the top 10. Non-content features can easily be embedded in a language model framework as a prior probability. |
| Item Type: | Conference or Workshop Item |
| Faculty: | Electrical Engineering, Mathematics and Computer Science (EEMCS) |
| Research Group: | |
| Link to this item: | http://purl.utwente.nl/publications/63507 |
| Official URL: | http://doi.acm.org/10.1145/564376.564383 |
| Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page

Show download statistics for this publication
Show download statistics for this publication