مرکز منطقه ای اطلاع رساني علوم و فناوري - Multilayer perceptron with sparse hidden outputs for phoneme recognition

DocumentCode :

2179204

Title :

Multilayer perceptron with sparse hidden outputs for phoneme recognition

Author :

Sivaram, G.S.V.S. ; Hermansky, Hynek

Author_Institution :

Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

5336

Lastpage :

5339

Abstract :

This paper introduces the sparse multilayer perceptron (SMLP) which learns the transformation from the inputs to the targets as in multilayer perceptron (MLP) while the outputs of one of the internal hidden layers is forced to be sparse. This is achieved by adding a sparse regularization term to the cross-entropy cost and learning the parameters of the network to minimize the joint cost. On the TIMIT phoneme recognition task, the SMLP based system trained using perceptual linear prediction (PLP) features performs better than the conventional MLP based system. Furthermore, their combination yields a phoneme error rate of 21.2%, a relative improvement of 6.2% over the baseline.

Keywords :

multilayer perceptrons; speech recognition; SMLP; TIMIT phoneme recognition task; sparse hidden outputs; sparse multilayer perceptron; Acoustics; Cost function; Hidden Markov models; Neurons; Speech; Speech processing; Training; Multilayer perceptron; machine learning; phoneme recognition; sparse features;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947563

Filename :

5947563

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2179204