Title :
Continuous speech recognition using PLP analysis with multilayer perceptrons
Author :
Morgan, Nigel ; Hermansky, H. ; Bourlard, H. ; Kohn, P. ; Wooters, C.
Author_Institution :
Int. Comput. Sci. Inst., Berkeley, CA, USA
Abstract :
The authors investigate the use of continuous features derived by perceptual linear predictive (PLP) analysis, examine the effect of adding temporal features, and compare it to the previously studied use of multiframe input. Comparisons of the MLP (multilayer perceptron) and conventional Gaussian classifiers are also reported. The speaker-dependent portion of the Resource Management database was used for this test. Additionally, some experiments were performed with a perplexity-2200 speaker-independent recognition task on a subset of the TIMIT database. In each case, the PLP features were used as input to the networks. The experiments show the advantage of continuous PLP features and their first and second temporal derivatives
Keywords :
filtering and prediction theory; neural nets; speech recognition; Gaussian classifiers; PLP analysis; Resource Management database; TIMIT database; continuous speech recognition; multiframe input; multilayer perceptrons; neural networks; perceptual linear predictive analysis; perplexity-2200 speaker-independent recognition; speaker-dependent portion; temporal features; Cepstral analysis; Computer science; Hidden Markov models; Information analysis; Multilayer perceptrons; Resource management; Spatial databases; Speech analysis; Speech recognition; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150275