DocumentCode :
2182426
Title :
Online detection of vocal Listener Responses with maximum latency constraints
Author :
Neiberg, Daniel ; Truong, Khiet P.
Author_Institution :
Dept. of Speech, Music & Hearing, R. Inst. of Technol. (KTH), Stockholm, Sweden
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5836
Lastpage :
5839
Abstract :
When human listeners utter Listener Responses (e.g. back-channels or acknowledgments) such as ´yeah´ and ´mmhmm´, interlocutors commonly continue to speak or resume their speech even before the listener has finished his/her response. This type of speech interactivity results in frequent speech overlap which is common in human human conversation. To allow for this type of speech interactivity to occur between humans and spoken dialog systems, which will result in more human-like continuous and smoother human-machine inter action, we propose an on-line classifier which can classify incoming speech as Listener Responses. We show that it is possible to detect vocal Listener Responses using maximum latency thresholds of 100-500 ms, thereby obtaining equal error rates ranging from 34% to 28% by using an energy based voice activity detector.
Keywords :
speech processing; human-like continuous; human-machine interaction; maximum latency constraints; online detection; speech interactivity; spoken dialog systems; vocal listener responses; Acoustics; Detectors; Feature extraction; Humans; Speech; Speech recognition; Training; Speech processing; speech analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947688
Filename :
5947688
Link To Document :
بازگشت