Natural scene recognition based on Convolutional Neural Networks and Deep Boltzmannn Machines

Author

Jingyu Gao ; Jinfu Yang ; Jizhao Zhang ; Mingai Li

Author_Institution

Dept. of Control & Eng., Beijing Univ. of Technol., Chaoyang, China

fYear

2015

fDate

2-5 Aug. 2015

Firstpage

2369

Lastpage

2374

Abstract

Scene recognition is a significant topic in computer vision, and Deep Boltzmann Machines (DBM) is a state-of-the-art deep learning model which has been widely applied in object and hand written digit recognition. However, when the DBM is used in scene recognition, it is difficult to handle large images due to its computational complexity. In this paper, we present a deep learning method based on Convolutional Neural Networks (CNN) and DBM for scene image recognition. First, in order to categorize large images, the CNN is utilized to preprocess images for dimensional reduction. Then, regarding the preprocessed images as the input of the visible layer, the DBM model is trained using Contrastive Divergence (CD) algorithm. Finally, after extracting features by the DBM, the softmax regression is employed to perform scene recognition tasks. Since the CNN can reduce effectively image size, the proposed method can improve the computational efficiency and becomes more suitable for large image recognition. Experimental evaluations using SIFT Flow dataset and fifteen-scene dataset demonstrate that the proposed method can obtain promising results.

Keywords

Boltzmann machines; convolution; feature extraction; image recognition; natural scenes; regression analysis; CD algorithm; CNN; DBM; SIFT flow dataset; computational complexity; computational efficiency; computer vision; contrastive divergence; convolutional neural networks; deep Boltzmannn machines; deep learning model; dimensional reduction; feature extraction; image preprocessing; image size; large images categorization; natural scene recognition; scene dataset; scene image recognition; scene recognition tasks; softmax regression; visible layer; Computational modeling; Convolution; Feature extraction; Kernel; Mathematical model; Sensitivity; Training; Convolutional Neural Networks; Deep Boltzmann Machines; Deep Learning; Scene Recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Mechatronics and Automation (ICMA), 2015 IEEE International Conference on

Conference_Location

Beijing

Print_ISBN

978-1-4799-7097-1

Type

conf

DOI

10.1109/ICMA.2015.7237857

Filename

7237857