Title :
A multivariate speech activity detector based on the syllable rate
Author :
Smith, David C. ; Townsend, Jefrey ; Nelson, Douglas J. ; Richman, Dan
Author_Institution :
Dept. of Defense, Fort Meade, MD, USA
Abstract :
Computationally efficient speech extraction algorithms have significant potential economic benefit, by automating an extremely tedious manual process. Previously, algorithms which discriminate between speech and one specific other signal type have been developed, and often fail when the specific non-speech signal is replaced by a different signal type. Moreover, several such signal specific discriminators have been combined in order to tackle the general speech vs. non-speech discrimination problem, with predictable negative results. When the number of discriminating features is large, compression methods such as principal components have been applied to reduce dimension, even though information may be lost in the process. In this paper, graphical tools are applied to determine a set of features which produce excellent speech vs. nonspeech clustering. This cluster structure provides the basis for a general speech vs. non-speech discriminator, which significantly outperforms the TALKATIVE speech extraction algorithm
Keywords :
feature extraction; pattern clustering; speech recognition; clustering; computationally efficient speech extraction algorithms; graphical tools; multivariate speech activity detector; nonspeech signal; speech/nonspeech discrimination problem; syllable rate; Acoustic signal detection; Autocorrelation; Clustering algorithms; Data mining; Detectors; Economic forecasting; Modems; Signal to noise ratio; Speech processing; US Department of Defense;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758065