Experiences with IR Top N Optimization in a Main Memory DBMS: Applying 'The Database Approach' in New Domains

Share/Save/Bookmark

Blok, H.E. and Vries, A.P. de and Blanken, H.M. and Apers, P.M.G. (2001) Experiences with IR Top N Optimization in a Main Memory DBMS: Applying 'The Database Approach' in New Domains. In: Advances in Databases : Proceedings of the 18th British National Conference on Databases (BNCOD 2001), 9-11 Jul 2001, Chilton, UK (pp. pp. 126-151).

[img] PDF
Restricted to UT campus only
: Request a copy
287kB
Abstract:Data abstraction and query processing techniques are usually studied in the domain of administrative applications. We present a case-study in the non-standard domain of (multimedia) information retrieval, mainly intended as a feasibility study in favor of the `database approach' to data management.

Top-N queries form a natural query class when dealing with content retrieval. In the IR field, a lot of research has been done on processing top-N queries efficiently. Unfortunately, these results cannot directly be ported to the database environment, because their tuple-oriented nature would seriously limit the freedom of the query optimizer to select appropriate query plans.

By horizontally fragmenting our database containing document statistics, we are able to combine some of the best of the IR and database optimization principles, providing good retrieval quality as well as database `goodies' like flexibility, scalability, efficiency, and generality. Key issues we address in this paper concern the effects of our fragmentation approach on speed and quality of the answers, opportunities for scalability, supported by experimental results.
Item Type:Conference or Workshop Item
Additional information:Imported from EWI/DB PMS [db-utwente:inpr:0000000002]
Faculty:
Electrical Engineering, Mathematics and Computer Science (EEMCS)
Link to this item:http://purl.utwente.nl/publications/63489
Official URL:http://www.springerlink.com/content/1x67w2x2jqpn7y9u/
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page

Metis ID: 202640