DocumentCode
1894414
Title
Continuous speech recognition using PLP analysis with multilayer perceptrons
Author
Morgan, Nigel ; Hermansky, H. ; Bourlard, H. ; Kohn, P. ; Wooters, C.
Author_Institution
Int. Comput. Sci. Inst., Berkeley, CA, USA
fYear
1991
fDate
14-17 Apr 1991
Firstpage
49
Abstract
The authors investigate the use of continuous features derived by perceptual linear predictive (PLP) analysis, examine the effect of adding temporal features, and compare it to the previously studied use of multiframe input. Comparisons of the MLP (multilayer perceptron) and conventional Gaussian classifiers are also reported. The speaker-dependent portion of the Resource Management database was used for this test. Additionally, some experiments were performed with a perplexity-2200 speaker-independent recognition task on a subset of the TIMIT database. In each case, the PLP features were used as input to the networks. The experiments show the advantage of continuous PLP features and their first and second temporal derivatives
Keywords
filtering and prediction theory; neural nets; speech recognition; Gaussian classifiers; PLP analysis; Resource Management database; TIMIT database; continuous speech recognition; multiframe input; multilayer perceptrons; neural networks; perceptual linear predictive analysis; perplexity-2200 speaker-independent recognition; speaker-dependent portion; temporal features; Cepstral analysis; Computer science; Hidden Markov models; Information analysis; Multilayer perceptrons; Resource management; Spatial databases; Speech analysis; Speech recognition; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location
Toronto, Ont.
ISSN
1520-6149
Print_ISBN
0-7803-0003-3
Type
conf
DOI
10.1109/ICASSP.1991.150275
Filename
150275
Link To Document