Title :
A robust speech/non-speech detection algorithm using time and frequency-based features
Author :
Mak, Brian ; Junqua, Jean-Claude ; Reaves, Ben
Author_Institution :
Speech Technol. Lab., Panasonic Technologies Inc., Santa Barbara, CA, USA
Abstract :
The authors address the problem of automatic endpoint detection in normal and adverse conditions. Attention has been given to automatic endpoint detection for both additive noise and noise-induced changes in the talker´s speech production (Lombard reflex). After a comparison of several automatic endpoint detection algorithms in different noisy-Lombard conditions, the authors propose a new algorithm. This algorithm identifies islands of reliability (essentially the portion of speech contained between the first and last vowel) using time- and frequency-based features and then applies a noise adaptive procedure to refine the endpoints. It is shown that this algorithm outperforms the commonly used algorithm developed by Lamel et al. (1981), and several other recently developed methods
Keywords :
acoustic noise; speech recognition; Lombard reflex; additive noise; automatic endpoint detection; frequency-based features; noise adaptive procedure; noise-induced changes; robust speech endpoint detection algorithm; time-based features; 1f noise; Additive noise; Automatic speech recognition; Databases; Detection algorithms; Frequency; Laboratories; Robustness; Signal to noise ratio; Speech enhancement;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.225919