• DocumentCode
    2049049
  • Title

    Natural scene recognition based on Convolutional Neural Networks and Deep Boltzmannn Machines

  • Author

    Jingyu Gao ; Jinfu Yang ; Jizhao Zhang ; Mingai Li

  • Author_Institution
    Dept. of Control & Eng., Beijing Univ. of Technol., Chaoyang, China
  • fYear
    2015
  • fDate
    2-5 Aug. 2015
  • Firstpage
    2369
  • Lastpage
    2374
  • Abstract
    Scene recognition is a significant topic in computer vision, and Deep Boltzmann Machines (DBM) is a state-of-the-art deep learning model which has been widely applied in object and hand written digit recognition. However, when the DBM is used in scene recognition, it is difficult to handle large images due to its computational complexity. In this paper, we present a deep learning method based on Convolutional Neural Networks (CNN) and DBM for scene image recognition. First, in order to categorize large images, the CNN is utilized to preprocess images for dimensional reduction. Then, regarding the preprocessed images as the input of the visible layer, the DBM model is trained using Contrastive Divergence (CD) algorithm. Finally, after extracting features by the DBM, the softmax regression is employed to perform scene recognition tasks. Since the CNN can reduce effectively image size, the proposed method can improve the computational efficiency and becomes more suitable for large image recognition. Experimental evaluations using SIFT Flow dataset and fifteen-scene dataset demonstrate that the proposed method can obtain promising results.
  • Keywords
    Boltzmann machines; convolution; feature extraction; image recognition; natural scenes; regression analysis; CD algorithm; CNN; DBM; SIFT flow dataset; computational complexity; computational efficiency; computer vision; contrastive divergence; convolutional neural networks; deep Boltzmannn machines; deep learning model; dimensional reduction; feature extraction; image preprocessing; image size; large images categorization; natural scene recognition; scene dataset; scene image recognition; scene recognition tasks; softmax regression; visible layer; Computational modeling; Convolution; Feature extraction; Kernel; Mathematical model; Sensitivity; Training; Convolutional Neural Networks; Deep Boltzmann Machines; Deep Learning; Scene Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mechatronics and Automation (ICMA), 2015 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-7097-1
  • Type

    conf

  • DOI
    10.1109/ICMA.2015.7237857
  • Filename
    7237857