Visual Alphabets: Video Classification by End Users


Share/Save/Bookmark

Israël, Menno and Broek van den, Egon L. and Putten van der, Peter and Uyl den, Marten J. (2006) Visual Alphabets: Video Classification by End Users. In: Multimedia data mining and knowledge discovery. Springer, London, pp. 185-206. ISBN 9781846284366

[img]PDF
Restricted to UT campus only

1354Kb
Abstract:The work presented here introduces a real-time automatic scene classifier within content-based video retrieval. In our envisioned approach end users like documentalists, not image processing experts, build classifiers interactively, by simply indicating positive examples of a scene. Classification consists of a two-stage procedure. First, small image fragments called patches are classified. Second, frequency vectors of these patch classifications are fed into a second classifier for global scene classification (e.g., city, portraits, or countryside). The first stage classifiers can be seen as a set of highly specialized, learned feature detectors, as an alternative to letting an image processing expert determine features a priori. The end user or domain expert thus builds a visual alphabet that can be used to describe the image in features that are relevant for the task at hand.We present results for experiments on a variety of patch and image classes. The scene classifier approach has been successfully applied to other domains of video content analysis, such as content-based video retrieval in television archives, automated sewer inspection, and porn filtering.
Item Type:Book Section
Copyright:© 2006 Springer
Faculty:
Electrical Engineering, Mathematics and Computer Science (EEMCS)
Research Group:
Link to this item:http://purl.utwente.nl/publications/58737
Official URL:http://dx.doi.org/10.1007/978-1-84628-799-2_10
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page

Metis ID: 243090