• DocumentCode
    3197166
  • Title

    Beyond Accuracy: Typicality Ranking for Video Annotation

  • Author

    Tang, Jinhui ; Hua, Xian-Sheng ; Qi, Guo-Jun ; Gu, Zhiwei ; Wu, Xiuqing

  • Author_Institution
    China Univ. of Sci. & Technol., Hefei
  • fYear
    2007
  • fDate
    2-5 July 2007
  • Firstpage
    647
  • Lastpage
    650
  • Abstract
    In this paper, we address the issue of typicality ranking for video annotation and propose to use a novel criterion, average typicality precision (ATP), to replace the frequently used one, average precision (AP), for evaluating the performance of video annotation algorithms. General annotation methods just care the number of true-positive samples at the top of the ranked list; they actually do not care the order of these samples. We argue that it is more reasonable to rank "typical" true-positive samples higher than non-typical ones, which can be evaluated by our proposed ATP. However, generally the labels of the training data only differentiate true from false; that is to say, typical or non-typical training samples have the same contribution to the learning process. Therefore, the labels of the unlabeled data learned from these training data can not well measure the typicality. In this paper, we relax the labels of the training data to real-valued typicality scores by a pre-processing stage, which is accomplished by three approaches, including density estimation, user feedback and active learning. Then the typicality scores of the training data are propagated to unlabeled data using manifold-ranking. Experiments conducted on the TRECVID data set demonstrate that this typicality ranking scheme is more consistent with human perception than normal accuracy based ranking schemes.
  • Keywords
    feature extraction; learning (artificial intelligence); video retrieval; video signal processing; active learning process; average typicality precision; density estimation; high-level feature extraction; human perception; manifold ranking; typicality ranking; user feedback; video annotation algorithm; Asia; Costs; Databases; Feature extraction; Feedback; Humans; Information analysis; Labeling; Training data; Video compression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2007 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    1-4244-1016-9
  • Electronic_ISBN
    1-4244-1017-7
  • Type

    conf

  • DOI
    10.1109/ICME.2007.4284733
  • Filename
    4284733