Title :
Robust Endpoint Detection for Speech Recognition Based on Discriminative Feature Extraction
Author :
Yamamoto, Koichi ; Jabloun, Firas ; Reinhard, Klaus ; Kawamura, Akinori
Author_Institution :
Multimedia Lab., Toshiba Corp., Tokyo
Abstract :
Accurate endpoint detection is important for improving the speech recognition capability. This paper proposes a novel endpoint detection method which combines energy-based and likelihood ratio-based voice activity detection (VAD) criteria, where the likelihood ratio is calculated with speech/non-speech Gaussian mixture models (GMMs). Moreover, the proposed method introduces the discriminative feature extraction technique (DFE) in order to improve the speech/non-speech classification. The DFE is used in the training of parameters required for calculating the likelihood ratio. Experimental results have shown that the proposed endpointer achieves good performance compared to an energy-based endpointer in terms of start-of-speech (SOS) and end-of-speech (EOS) detections. Due to the improvement of the endpointer, the performance of automatic speech recognition (ASR) has also been improved
Keywords :
Gaussian processes; feature extraction; speech recognition; automatic speech recognition; discriminative feature extraction; end-of-speech detections; nonspeech Gaussian mixture models; nonspeech classification; robust endpoint detection; start-of-speech detections; voice activity detection; Automatic speech recognition; Feature extraction; Laboratories; Linear discriminant analysis; Noise level; Noise robustness; Research and development; Speech recognition; Tellurium; Working environment noise;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1660143