DocumentCode
3640864
Title
Hierarchical tandem feature extraction
Author
Sunil Sivadas;Hynek Hermansky
Author_Institution
Oregon Graduate Institute of Science and Technology, Portland, USA
Volume
1
fYear
2002
fDate
5/1/2002 12:00:00 AM
Abstract
We present a hierarchical architecture for tandem acoustic modeling. In the tandem acoustic modeling paradigm a Multi Layer Perceptron (MLP) is discriminatively trained to estimate phoneme posterior probabilities on a labeled database. The outputs of the MLP after nonlinear transformation and whitening are used as features in a Gaussian Mixture Model (GMM) based recognizer. In this paper we replace the large monolithic MLP with hierarchies of MLP experts. We apply this approach on Speech in Noisy Environments (SPINE 1) evaluation conducted by the Naval Research Laboratory (NRL). We observe a reduction in word error rate of 30% with context-independent models and 5% WER with context-dependent models relative to PLP features.
Keywords
"Artificial neural networks","Books","Speech"
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743841
Filename
5743841
Link To Document