Visual Alphabets: Video Classification by End Users


Israël, Menno and Broek, Egon L. van den and Putten, Peter van der and Uyl, Marten J. den (2006) Visual Alphabets: Video Classification by End Users. In: Valery A. Petrushin & Latifur Khan (Eds.), Multimedia data mining and knowledge discovery. Springer, London, pp. 185-206. ISBN 9781846284366

[img] PDF
Restricted to UT campus only

Abstract:The work presented here introduces a real-time automatic scene classifier within content-based video retrieval. In our envisioned approach end users like documentalists, not image processing experts, build classifiers interactively, by simply indicating positive examples of a scene. Classification consists of a two-stage procedure. First, small image fragments called patches are classified. Second, frequency vectors of these patch classifications are fed into a second classifier for global scene classification (e.g., city, portraits, or countryside). The first stage classifiers can be seen as a set of highly specialized, learned feature detectors, as an alternative to letting an image processing expert determine features a priori. The end user or domain expert thus builds a visual alphabet that can be used to describe the image in features that are relevant for the task at hand.We present results for experiments on a variety of patch and image classes. The scene classifier approach has been successfully applied to other domains of video content analysis, such as content-based video retrieval in television archives, automated sewer inspection, and porn filtering.
Item Type:Book Section
Copyright:© 2006 Springer
Electrical Engineering, Mathematics and Computer Science (EEMCS)
Research Group:
Link to this item:
Official URL:
Export this item as:BibTeX
HTML Citation
Reference Manager


Repository Staff Only: item control page

Metis ID: 243090