DocumentCode :
3019279
Title :
Video scene classification based on natural language description
Author :
Zhang, Lei ; Khan, Muhammad Usman Ghani ; Gotoh, Yoshihiko
Author_Institution :
Harbin Eng. Univ., Harbin, China
fYear :
2011
fDate :
6-13 Nov. 2011
Firstpage :
942
Lastpage :
949
Abstract :
This paper addresses the problem of video scene classification based on the small amount of natural language description created for the video stream. The approach incorporates a conventional tf·idf term-document matrix with scene class specific information derived using the maximum a posteriori (MAP) estimates and the chi-square statistic. Further latent semantic analysis (LSA) is applied to find co-occurrence terms between documents. The experiment adopts the k-nearest neighbour (kNN) and the support vector machine (SVM) classifiers to evaluate the effectiveness of scene class information and co-occurrence terms. They achieved 83.86% (kNN) and 98.11% (SVM) when the MAP estimates and the chi-square statistic were combined with the tf·idf term-document matrix, followed by LSA approximation.
Keywords :
approximation theory; document handling; image classification; natural language processing; statistical analysis; support vector machines; video signal processing; LSA approximation; chi-square statistic; k-nearest neighbour; latent semantic analysis; maximum a posteriori estimation; natural language description; support vector machine classifier; tf-idf term-document matrix; video scene classification; video stream; Approximation methods; Humans; Natural languages; Semantics; Streaming media; Support vector machines; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4673-0062-9
Type :
conf
DOI :
10.1109/ICCVW.2011.6130353
Filename :
6130353
Link To Document :
بازگشت