DocumentCode
2179204
Title
Multilayer perceptron with sparse hidden outputs for phoneme recognition
Author
Sivaram, G.S.V.S. ; Hermansky, Hynek
Author_Institution
Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
5336
Lastpage
5339
Abstract
This paper introduces the sparse multilayer perceptron (SMLP) which learns the transformation from the inputs to the targets as in multilayer perceptron (MLP) while the outputs of one of the internal hidden layers is forced to be sparse. This is achieved by adding a sparse regularization term to the cross-entropy cost and learning the parameters of the network to minimize the joint cost. On the TIMIT phoneme recognition task, the SMLP based system trained using perceptual linear prediction (PLP) features performs better than the conventional MLP based system. Furthermore, their combination yields a phoneme error rate of 21.2%, a relative improvement of 6.2% over the baseline.
Keywords
multilayer perceptrons; speech recognition; SMLP; TIMIT phoneme recognition task; sparse hidden outputs; sparse multilayer perceptron; Acoustics; Cost function; Hidden Markov models; Neurons; Speech; Speech processing; Training; Multilayer perceptron; machine learning; phoneme recognition; sparse features;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947563
Filename
5947563
Link To Document