Online Detection Of Vocal Listener Responses With Maximum Latency Constraints
Neiberg, Daniel and Truong, Khiet P. (2011) Online Detection Of Vocal Listener Responses With Maximum Latency Constraints. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, 22-27 May 2011, Prague, Czech Republic.
| PDF Restricted to UT campus only: Request a copy 123Kb |
| Abstract: | When human listeners utter Listener Responses (e.g. back-channels or acknowledgments) such as 'yeah' and 'mmhmm', interlocutors commonly continue to speak or resume their speech even before the listener has finished his/her response. This type of speech interactivity results in frequent speech overlap which is common in human-human conversation. To allow for this type of speech interactivity to occur between humans and spoken dialog systems, which will result in more human-like continuous and smoother human-machine interaction, we propose an on-line classifier which can classify incoming speech as Listener Responses. We show that it is possible to detect vocal Listener Responses using maximum latency thresholds of 100-500 ms, thereby obtaining equal error rates ranging from 34% to 28% by using an energy based voice activity detector. |
| Item Type: | Conference or Workshop Item |
| Copyright: | © 2011 IEEE |
| Faculty: | Electrical Engineering, Mathematics and Computer Science (EEMCS) |
| Research Group: | |
| Link to this item: | http://purl.utwente.nl/publications/77316 |
| Export this item as: | BibTeX EndNote HTML Citation Reference Manager |
Repository Staff Only: item control page

Show download statistics for this publication
Show download statistics for this publication