Creating an Information Retrieval test corpus for Dutch
Hiemstra, D. and van Leeuwen, D.A. (2002) Creating an Information Retrieval test corpus for Dutch. In: Computational Linguistics in the Netherlands 2001. Selected Papers of the 12th meeting of Computational Linguistics in the Netherlands (CLIN 2001), 30 Nov 2001, Enschede, The Netherlands.
| PDF 85Kb |
| Abstract: | This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch test data, which is part of the official CLEF multilingual test corpus, and give an overview of the experimental results of companies and research institutions that participated in the first official Dutch CLEF experiments. Judging from these experiments, the handling of languagespecific issues of Dutch, like for instance simple morphology and compound nouns, significantly improves the performance of information retrieval systems in many cases. Careful examination of the test collection shows that it serves as a reliable tool for the evaluation of information retrieval systems in the future. |
| Item Type: | Conference or Workshop Item |
| Faculty: | Electrical Engineering, Mathematics and Computer Science (EEMCS) |
| Research Group: | |
| Link to this item: | http://purl.utwente.nl/publications/66442 |
| Official URL: | http://openurl.ingenta.com/content?genre=article&issn=0921-5034&volume=45&issue=1&spage=133&epage=147 |
| Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page

Show download statistics for this publication
Show download statistics for this publication