DocumentCode :
3424219
Title :
A voice activity detection based on the adaptive integration of multiple speech features and a signal decision scheme
Author :
Fujimoto, Masakiyo ; Ishizuka, Kentaro ; Nakatani, Tomohiro
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4441
Lastpage :
4444
Abstract :
This paper addresses the problem of voice activity detection (VAD) in noisy environments. The VAD method proposed in this paper integrates multiple speech features and a signal decision scheme, namely the speech periodic to aperiodic component ratio and a switching Kalman filter. The integration is carried out by using the weighted sum of likelihoods outputted from each VAD (stream). The stream weight is decided adaptively each short time frame. The evaluation is carried out by using a VAD evaluation framework, CENSREC- 1-C. The evaluation results revealed that the proposed method significantly outperforms the baseline results of CENSREC-1-C as regards VAD accuracy in real environments. In addition, we carried out speech recognition evaluations by using detected speech signals, and confirmed that the proposed method contributes to an improvement in speech recognition accuracy.
Keywords :
Kalman filters; signal detection; speech recognition; adaptive integration; detected speech signals; multiple speech features; signal decision scheme; switching Kalman filter; voice activity detection; Acoustic signal processing; Adaptive signal detection; Concatenated codes; Feature extraction; Kalman filters; Noise robustness; Speech analysis; Speech processing; Speech recognition; Working environment noise; adaptive integration; periodic to aperiodic component ratio; switching Kalman filter; voice activity detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518641
Filename :
4518641
Link To Document :
بازگشت