Automated speech and audio analysis for semantic access to multimedia
Jong de, Franciska and Ordelman, Roeland and Huijbregts, Marijn (2006) Automated speech and audio analysis for semantic access to multimedia. In: First International Conference on Semantic and Digital Media Technologies, SAMT 2006, 6-8 December 2006, Athens, Greece.
| PDF 303Kb |
| Abstract: | The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to increased granularity of automatically extracted metadata. A number of techniques will be presented, including the alignment of speech and text resources, large vocabulary speech recognition, key word spotting and speaker classification. The applicability of techniques will be discussed from a media crossing perspective. The added value of the techniques and their potential contribution to the content value chain will be illustrated by the description of two (complementary) demonstrators for browsing broadcast news archives. |
| Item Type: | Conference or Workshop Item |
| Copyright: | © 2006 Springer |
| Faculty: | Electrical Engineering, Mathematics and Computer Science (EEMCS) |
| Research Group: | |
| Link to this item: | http://purl.utwente.nl/publications/66586 |
| Official URL: | http://dx.doi.org/10.1007/11930334_18 |
| Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page
Metis ID: 237582

Show download statistics for this publication
Show download statistics for this publication