DocumentCode :
1475318
Title :
Sparse Multilayer Perceptron for Phoneme Recognition
Author :
Sivaram, G.S.V.S. ; Hermansky, Hynek
Author_Institution :
ECE Dept., Johns Hopkins Univ., Baltimore, MD, USA
Volume :
20
Issue :
1
fYear :
2012
Firstpage :
23
Lastpage :
29
Abstract :
This paper introduces the sparse multilayer perceptron (SMLP) which jointly learns a sparse feature representation and nonlinear classifier boundaries to optimally discriminate multiple output classes. SMLP learns the transformation from the inputs to the targets as in multilayer perceptron (MLP) while the outputs of one of the internal hidden layers is forced to be sparse. This is achieved by adding a sparse regularization term to the cross-entropy cost and updating the parameters of the network to minimize the joint cost. On the TIMIT phoneme recognition task, SMLP-based systems trained on individual speech recognition feature streams perform significantly better than the corresponding MLP-based systems. Phoneme error rate of 19.6% is achieved using the combination of SMLP-based systems, a relative improvement of 3.0% over the combination of MLP-based systems.
Keywords :
entropy; error statistics; feature extraction; multilayer perceptrons; speech recognition; SMLP-based systems; TIMIT phoneme recognition task; cross-entropy cost; internal hidden layers; multiple output classes; nonlinear classifier boundary; phoneme error rate; sparse feature representation; sparse multilayer perceptron; sparse regularization term; speech recognition feature streams; Acoustics; Cost function; Hidden Markov models; Multilayer perceptrons; Neurons; Speech recognition; Training; Multilayer perceptron (MLP); phoneme recognition; sparse features;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2011.2129510
Filename :
5734801
Link To Document :
بازگشت