DocumentCode
2174592
Title
Comparing multilayer perceptron to Deep Belief Network Tandem features for robust ASR
Author
Vinyals, Oriol ; Ravuri, Suman V.
Author_Institution
Int. Comput. Sci. Inst., Berkeley, CA, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
4596
Lastpage
4599
Abstract
In this paper, we extend the work done on integrating multilayer perceptron (MLP) networks with HMM systems via the Tandem approach. In particular, we explore whether the use of Deep Belief Networks (DBN) adds any substantial gain over MLPs on the Aurora2 speech recognition task under mismatched noise conditions. Our findings suggest that DBNs outperform single layer MLPs under the clean condition, but the gains diminish as the noise level is increased. Furthermore, using MFCCs in conjunction with the posteriors from DBNs outperforms merely using single DBNs in low to moderate noise conditions. MFCCs, however, do not help for the high noise settings.
Keywords
belief networks; hidden Markov models; multilayer perceptrons; speech recognition; DBN; HMM system; MLP network; belief network tandem feature; deep belief network; mismatched noise conditions; multilayer perceptron; robust ASR; speech recognition; Accuracy; Mel frequency cepstral coefficient; Noise measurement; Signal to noise ratio; Speech recognition; Training; Automatic Speech Recognition; Deep Belief Network; Multilayer Perceptron;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947378
Filename
5947378
Link To Document