• DocumentCode
    9199
  • Title

    The Spike-and-Slab RBM and Extensions to Discrete and Sparse Data Distributions

  • Author

    Courville, Aaron ; Desjardins, Guillaume ; Bergstra, James ; Bengio, Yoshua

  • Author_Institution
    Dept. of Comput. Sci. & Oper. Res., Univ. of Montreal, Montreal, QC, Canada
  • Volume
    36
  • Issue
    9
  • fYear
    2014
  • fDate
    Sept. 2014
  • Firstpage
    1874
  • Lastpage
    1887
  • Abstract
    The spike-and-slab restricted Boltzmann machine (ssRBM) is defined to have both a real-valued “slab” variable and a binary “spike” variable associated with each unit in the hidden layer. The model uses its slab variables to model the conditional covariance of the observation-thought to be important in capturing the statistical properties of natural images. In this paper, we present the canonical ssRBM framework together with some extensions. These extensions highlight the flexibility of the spike-and-slab RBM as a platform for exploring more sophisticated probabilistic models of high dimensional data in general and natural image data in particular. Here, we introduce the subspace-ssRBM focused on the task of learning invariant features. We highlight the behaviour of the ssRBM and its extensions through experiments with the MNIST digit recognition task and the CIFAR-10 object classification task.
  • Keywords
    Boltzmann machines; image classification; natural scenes; statistical analysis; unsupervised learning; CIFAR-10 object classification task; MNIST digit recognition task; binary spike variable; canonical ssRBM framework; conditional covariance; discrete data distributions; invariant feature learning; real-valued slab variable; sophisticated probabilistic models; sparse data distributions; spike-and-slab RBM; spike-and-slab restricted Boltzmann machine; statistical natural image properties; subspace-ssRBM framework; Covariance matrices; Data models; Feature extraction; Slabs; Standards; Training; Vectors; Feature learning; natural image modeling; restricted boltzmann machines; unsupervised learning;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2013.238
  • Filename
    6678502