Brute Force Information Retrieval Experiments using MapReduce

Share/Save/Bookmark

Hiemstra, Djoerd and Hauff, Claudia (2012) Brute Force Information Retrieval Experiments using MapReduce. ERCIM News (89). pp. 31-32. ISSN 0926-4981

[img]
Preview
PDF (Author's version)
236Kb
[img]
Preview
PDF (Publisher's version)
305Kb
Abstract:MIREX (MapReduce Information Retrieval Experiments) is a software library initially developed by the Database Group of the University of Twente for running large scale information retrieval experiments on clusters of machines. MIREX has been tested on web crawls of up to half a billion web pages, totalling about 12.5 TB of data uncompressed. MIREX shows that the execution of test queries by a brute force linear scan of pages, is a viable alternative to running the test queries on a search engine’s inverted index. MIREX is open source and available for others.
Item Type:Article
Faculty:
Electrical Engineering, Mathematics and Computer Science (EEMCS)
Research Group:
Link to this item:http://purl.utwente.nl/publications/80117
Official URL:http://ercim-news.ercim.eu/en89/special/brute-force-information-retrieval-experiments-using-mapreduce
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page