Title :
Deep Spatiotemporal Feature Learning with Application to Image Classification
Author :
Karnowski, Thomas P. ; Arel, Itamar ; Rose, Derek
Author_Institution :
Image Sci. & Machine Vision Group, Oak Ridge Nat. Lab., Oak Ridge, TN, USA
Abstract :
Deep machine learning is an emerging framework for dealing with complex high-dimensionality data in a hierarchical fashion which draws some inspiration from biological sources. Despite the notable progress made in the field, there remains a need for an architecture that can represent temporal information with the same ease that spatial information is discovered. In this work, we present new results using a recently introduced deep learning architecture called Deep Spatio-Temporal Inference Network (DeSTIN). DeSTIN is a discriminative deep learning architecture that combines concepts from unsupervised learning for dynamic pattern representation together with Bayesian inference. In DeSTIN the spatiotemporal dependencies that exist within the observations are modeled inherently in an unguided manner. Each node models the inputs by means of clustering and simple dynamics modeling while it constructs a belief state over the distribution of sequences using Bayesian inference. We demonstrate that information from the different layers of this hierarchical system can be extracted and utilized for the purpose of pattern classification. Earlier simulation results indicated that the framework is highly promising, consequently in this work we expand DeSTIN to a popular problem, the MNIST data set of handwritten digits. The system as a preprocessor to a neural network achieves a recognition accuracy of 97.98% on this data set. We further show related experimental results pertaining to automatic cluster adaptation and termination.
Keywords :
image classification; inference mechanisms; learning (artificial intelligence); neural nets; Bayesian inference; belief state; biological sources; complex high-dimensionality data; deep machine learning; deep spatio-temporal inference network; deep spatiotemporal feature learning; discriminative deep learning architecture; dynamic pattern representation; handwritten digits; hierarchical fashion; hierarchical system; image classification; neural network; pattern classification; simple dynamics modeling; spatial information; spatiotemporal dependencies; temporal information; unsupervised learning; Artificial neural networks; Computer architecture; Feature extraction; Machine learning; Measurement; Pixel; Training; biologically-inspired computing; deep learning; online clustering;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-9211-4
DOI :
10.1109/ICMLA.2010.138