DocumentCode :
3197166
Title :
Beyond Accuracy: Typicality Ranking for Video Annotation
Author :
Tang, Jinhui ; Hua, Xian-Sheng ; Qi, Guo-Jun ; Gu, Zhiwei ; Wu, Xiuqing
Author_Institution :
China Univ. of Sci. & Technol., Hefei
fYear :
2007
fDate :
2-5 July 2007
Firstpage :
647
Lastpage :
650
Abstract :
In this paper, we address the issue of typicality ranking for video annotation and propose to use a novel criterion, average typicality precision (ATP), to replace the frequently used one, average precision (AP), for evaluating the performance of video annotation algorithms. General annotation methods just care the number of true-positive samples at the top of the ranked list; they actually do not care the order of these samples. We argue that it is more reasonable to rank "typical" true-positive samples higher than non-typical ones, which can be evaluated by our proposed ATP. However, generally the labels of the training data only differentiate true from false; that is to say, typical or non-typical training samples have the same contribution to the learning process. Therefore, the labels of the unlabeled data learned from these training data can not well measure the typicality. In this paper, we relax the labels of the training data to real-valued typicality scores by a pre-processing stage, which is accomplished by three approaches, including density estimation, user feedback and active learning. Then the typicality scores of the training data are propagated to unlabeled data using manifold-ranking. Experiments conducted on the TRECVID data set demonstrate that this typicality ranking scheme is more consistent with human perception than normal accuracy based ranking schemes.
Keywords :
feature extraction; learning (artificial intelligence); video retrieval; video signal processing; active learning process; average typicality precision; density estimation; high-level feature extraction; human perception; manifold ranking; typicality ranking; user feedback; video annotation algorithm; Asia; Costs; Databases; Feature extraction; Feedback; Humans; Information analysis; Labeling; Training data; Video compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2007 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
1-4244-1016-9
Electronic_ISBN :
1-4244-1017-7
Type :
conf
DOI :
10.1109/ICME.2007.4284733
Filename :
4284733
Link To Document :
بازگشت