مرکز منطقه ای اطلاع رساني علوم و فناوري - Sparse Multilayer Perceptron for Phoneme Recognition

DocumentCode :

1475318

Title :

Sparse Multilayer Perceptron for Phoneme Recognition

Author :

Sivaram, G.S.V.S. ; Hermansky, Hynek

Author_Institution :

ECE Dept., Johns Hopkins Univ., Baltimore, MD, USA

Volume :

Issue :

fYear :

2012

Firstpage :

Lastpage :

Abstract :

This paper introduces the sparse multilayer perceptron (SMLP) which jointly learns a sparse feature representation and nonlinear classifier boundaries to optimally discriminate multiple output classes. SMLP learns the transformation from the inputs to the targets as in multilayer perceptron (MLP) while the outputs of one of the internal hidden layers is forced to be sparse. This is achieved by adding a sparse regularization term to the cross-entropy cost and updating the parameters of the network to minimize the joint cost. On the TIMIT phoneme recognition task, SMLP-based systems trained on individual speech recognition feature streams perform significantly better than the corresponding MLP-based systems. Phoneme error rate of 19.6% is achieved using the combination of SMLP-based systems, a relative improvement of 3.0% over the combination of MLP-based systems.

Keywords :

entropy; error statistics; feature extraction; multilayer perceptrons; speech recognition; SMLP-based systems; TIMIT phoneme recognition task; cross-entropy cost; internal hidden layers; multiple output classes; nonlinear classifier boundary; phoneme error rate; sparse feature representation; sparse multilayer perceptron; sparse regularization term; speech recognition feature streams; Acoustics; Cost function; Hidden Markov models; Multilayer perceptrons; Neurons; Speech recognition; Training; Multilayer perceptron (MLP); phoneme recognition; sparse features;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2011.2129510

Filename :

5734801

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1475318