DocumentCode :
1765107
Title :
The Potential Energy of an Autoencoder
Author :
Kamyshanska, Hanna ; Memisevic, Roland
Author_Institution :
Dept. of Comput. Neurosci., Frankfurt Inst. for Adv. Studies, Frankfurt, Germany
Volume :
37
Issue :
6
fYear :
2015
fDate :
June 1 2015
Firstpage :
1261
Lastpage :
1273
Abstract :
Autoencoders are popular feature learning models, that are conceptually simple, easy to train and allow for efficient inference. Recent work has shown how certain autoencoders can be associated with an energy landscape, akin to negative log-probability in a probabilistic model, which measures how well the autoencoder can represent regions in the input space. The energy landscape has been commonly inferred heuristically, by using a training criterion that relates the autoencoder to a probabilistic model such as a Restricted Boltzmann Machine (RBM). In this paper we show how most common autoencoders are naturally associated with an energy function, independent of the training procedure, and that the energy landscape can be inferred analytically by integrating the reconstruction function of the autoencoder. For autoencoders with sigmoid hidden units, the energy function is identical to the free energy of an RBM, which helps shed light onto the relationship between these two types of model. We also show that the autoencoder energy function allows us to explain common regularization procedures, such as contractive training, from the perspective of dynamical systems. As a practical application of the energy function, a generative classifier based on class-specific autoencoders is presented.
Keywords :
Boltzmann machines; learning (artificial intelligence); pattern classification; probability; RBM; class-specific autoencoders; contractive training; energy function; energy landscape; feature learning models; generative classifier; negative log-probability; potential energy; probabilistic model; reconstruction function; regularization procedures; restricted Boltzmann machine; sigmoid hidden units; training criterion; Analytical models; Data models; Potential energy; Principal component analysis; Probabilistic logic; Training; Vectors; Autoencoders; generative classification; representation learning; unsupervised learning;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2014.2362140
Filename :
6918504
Link To Document :
بازگشت