The Importance of Prior Probabilities for Entry Page Search


Share/Save/Bookmark

Kraaij, W. and Westerveld, T.H.W. and Hiemstra, D. (2002) The Importance of Prior Probabilities for Entry Page Search. In: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2002), 11-15 Aug 2002, Tampere, Finland (pp. pp. 27-34).

[img] PDF
Restricted to UT campus only
: Request a copy
135kB
Abstract:An important class of searches on the world-wide-web has the goal to find an entry page (homepage) of an organisation. Entry page search is quite different from Ad Hoc search. Indeed a plain Ad Hoc system performs disappointingly. We explored three non-content features of web pages: page length, number of incoming links and URL form. Especially the URL form proved to be a good predictor. Using URL form priors we found over 70% of all entry pages at rank 1, and up to 89% in the top 10. Non-content features can easily be embedded in a language model framework as a prior probability.
Item Type:Conference or Workshop Item
Additional information:Imported from EWI/DB PMS [db-utwente:inpr:0000003211]
Faculty:
Electrical Engineering, Mathematics and Computer Science (EEMCS)
Research Group:
Link to this item:http://purl.utwente.nl/publications/63507
Official URL:http://doi.acm.org/10.1145/564376.564383
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page