Title :
Modeling correlation between multi-modal continuous words for pLSA-based video classification
Author :
Cencen Zhong ; Zhenjiang Miao
Author_Institution :
Inst. of Inf. Sci., Beijing Jiaotong Univ., Beijing, China
Abstract :
Seeing that probabilistic Latent Semantic Analysis (pLSA) deals with discrete quantity only, pLSA with Gaussian Mixtures (GM-pLSA) extends it to continuous feature space by treating continuous feature as continuous word. However, GM-pLSA does not provide a clear way of modeling multimodal features, and also neglects the intrinsic correlation between these continuous words. In this paper, we present a graph regularized multi-modal GM-pLSA (GRMMGM-pLSA) model to incorporate such correlation between multimodal continuous words into the process of model learning. First, multiple GMMs are adopted with each depicting the distribution of continuous words from each modality; and then, a graph regularizer is introduced to capture the word correlation. In the task of video classification, GRMMGM-pLSA that takes both multi-modal visual features of sub-shots and word correlation in terms of temporal consistency between sub-shots into account is exploited to perform feature mapping. Experiments on YouTube videos show the effectiveness of our proposed model.
Keywords :
Gaussian processes; correlation methods; graph theory; image classification; mixture models; probability; video signal processing; GRMMGM-pLSA model; Gaussian mixtures; YouTube videos; correlation modeling; feature mapping; graph regularized multimodal GM-pLSA model; graph regularizer; model learning; multimodal continuous words; multimodal visual features; pLSA-based video classification; probabilistic latent semantic analysis; video classification; word correlation; graph regularizer; multi-modality; probabilistic Latent Semantic Analysis with Gaussian Mixtures; video classification; word correlation;
Conference_Titel :
Image Processing (ICIP), 2014 IEEE International Conference on
Conference_Location :
Paris
DOI :
10.1109/ICIP.2014.7025874