A Spoken Document Retrieval Application in the Oral History Domain


Share/Save/Bookmark

Huijbregts, Marijn and Ordelman, Roeland and Jong de, Franciska (2005) A Spoken Document Retrieval Application in the Oral History Domain. In: 10th international conference Speech and Computer, SPECOM 2005, October 17-19-2005, Patras, Greece.

[img]
Preview
PDF
40Kb
Abstract:The application of automatic speech recognition in the
broadcast news domain is well studied. Recognition performance
is generally high and accordingly, spoken document retrieval
can successfully be applied in this domain, as demonstrated
by a number of commercial systems. In other domains,
a similar recognition performance is hard to obtain, or even far
out of reach, for example due to lack of suitable training material.
This is a serious impediment for the successful application
of spoken document retrieval techniques for other data then
news. This paper outlines our first steps towards a retrieval system
that can automatically be adapted to new domains. We discuss
our experience with a recently implemented spoken document
retrieval application attached to a web-portal that aims at
the disclosure of a multimedia data collection in the oral history
domain. The paper illustrates that simply deploying an off-theshelf
broadcast news system in this task domain will produce
error rates that are too high to be useful for retrieval tasks. By
applying adaptation techniques on the acoustic level and language
model level, system performance can be improved considerably,
but additional research on unsupervised adaptation
and search interfaces is required to create an adequate search
environment based on speech transcripts.
Item Type:Conference or Workshop Item
Faculty:
Electrical Engineering, Mathematics and Computer Science (EEMCS)
Research Group:
Link to this item:http://purl.utwente.nl/publications/65566
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page

Metis ID: 227318