Title :
Multi-view Clustering of Visual Words Using Canonical Correlation Analysis for Human Action Recognition
Author :
Saghafi, Behrouz ; Rajan, Deepu
Author_Institution :
Centre For Multimedia & Network Technol., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
In this paper we propose a novel approach for introducing semantic relations into the bag-of-words framework for recognizing human actions. We represent visual words in two different views: the original features and the document co-occurrence representation. The latter view conveys semantic relations but is large, sparse and noisy. We use canonical correlation analysis between the two views to find a subspace in which the words are more semantically distributed. We apply k-means clustering in the computed space to find semantically meaningful clusters and use them as the semantic visual vocabulary. Incorporating the semantic visual vocabulary the features are quantized to form more discriminative histograms. Eventually the histograms are classified using an SVM classifier. We have tested our approach on KTH action dataset and achieved promising results.
Keywords :
correlation methods; gesture recognition; information retrieval; natural language processing; pattern clustering; support vector machines; vocabulary; KTH action dataset; SVM classifier; bag of word; canonical correlation analysis; discriminative histogram; document cooccurrence representation; human action recognition; k-means clustering; semantic visual vocabulary; Accuracy; Correlation; Feature extraction; Histograms; Semantics; Visualization; Vocabulary; Bag-of-words; Canonical Correlation Analysis; Clustering; Human Action Recognition; Multi-view;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-9211-4
DOI :
10.1109/ICMLA.2010.102