DocumentCode
3661192
Title
Stochastic least squares learning for deep architectures
Author
Girish Kumar;Jian Min Sim;Eng Yeow Cheu;Xiaoli Li
Author_Institution
NUS High School of Mathematics and Science, Singapore 129957
fYear
2015
fDate
7/1/2015 12:00:00 AM
Firstpage
1
Lastpage
7
Abstract
In this paper, we present a novel way of pre-training deep architectures by using the stochastic least squares autoencoder (SLSA). The SLSA is based on the combination of stochastic least squares estimation and logistic sampling. The usefulness of the stochastic least squares approach coupled with the numerical trick of constraining the logistic sampling process is highlighted in this paper. This approach was tested and benchmarked against other methods including Neural Nets (NN), Deep Belief Nets (DBN), and Stacked Denoising Autoencoder (SDAE) using the MNIST dataset. In addition, the SLSA architecture was also tested against established methods such as the Support Vector Machine (SVM), and the Naive Bayes Classifier (NB) on the Reuters-21578 and MNIST datasets. The experiments show the promise of SLSA as a pre-training step, in which stacked of SLSA yielded the lowest classification error and the highest F-measure scores on the MNIST and Reuters-21578 datasets respectively. Hence, this paper establishes the value of pre-training deep neural network, by using the SLSA.
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), 2015 International Joint Conference on
Electronic_ISBN
2161-4407
Type
conf
DOI
10.1109/IJCNN.2015.7280502
Filename
7280502
Link To Document