Automated speech and audio analysis for semantic access to multimedia


Jong, Franciska de and Ordelman, Roeland and Huijbregts, Marijn (2006) Automated speech and audio analysis for semantic access to multimedia. In: First International Conference on Semantic and Digital Media Technologies, SAMT 2006, 6-8 December 2006, Athens, Greece (pp. pp. 226-240).

open access
Abstract:The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to increased granularity of automatically extracted metadata. A number of techniques will be presented, including the alignment of speech and text resources, large vocabulary speech recognition, key word spotting and speaker classification. The applicability of techniques will be discussed from a media crossing perspective. The added value of the techniques and their potential contribution to the content value chain will be illustrated by the description of two (complementary) demonstrators for browsing broadcast news archives.
Item Type:Conference or Workshop Item
Copyright:© 2006 Springer
Electrical Engineering, Mathematics and Computer Science (EEMCS)
Research Group:
Link to this item:
Official URL:
Export this item as:BibTeX
HTML Citation
Reference Manager


Repository Staff Only: item control page

Metis ID: 237582