DocumentCode
3162268
Title
Improved pre-training of Deep Belief Networks using Sparse Encoding Symmetric Machines
Author
Plahl, Christian ; Sainath, Tara N. ; Ramabhadran, Bhuvana ; Nahamoo, David
Author_Institution
Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
fYear
2012
fDate
25-30 March 2012
Firstpage
4165
Lastpage
4168
Abstract
Restricted Boltzmann Machines (RBM) continue to be a popular methodology to pre-train weights of Deep Belief Networks (DBNs). However, the RBM objective function cannot be maximized directly. Therefore, it is not clear what function to monitor when deciding to stop the training, leading to a challenge in managing the computational costs. The Sparse Encoding Symmetric Machine (SESM) has been suggested as an alternative method for pre-training. By placing a sparseness term on the NN output codebook, SESM allows the objective function to be optimized directly and reliably be monitored as an indicator to stop the training. In this paper, we explore SESM to pre-train DBNs and apply this the first time to speech recognition. First, we provide a detailed analysis comparing the behavior of SESM and RBM. Second, we compare the performance of SESM pre-trained and RBM pre-trained DBNs on TIMIT and a 50 hour English Broadcast News task. Results indicate that pre-trained DBNs using SESM and RBMs achieve comparable performance and outperform randomly initialized DBNs with SESM providing a much easier stopping criterion relative to RBM.
Keywords
belief networks; encoding; speech recognition; DBN; NN output codebook; RBM; SESM; TIMIT; deep belief networks; english broadcast news task; pretraining improvement; restricted Boltzmann machine; sparse encoding symmetric machine; speech recognition; Artificial neural networks; Encoding; Linear programming; Speech; Speech recognition; Training; Deep belief network; neural network feature extraction; pre-training; sparse representation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288836
Filename
6288836
Link To Document