• DocumentCode
    1760435
  • Title

    Multi-Feature Fusion via Hierarchical Regression for Multimedia Analysis

  • Author

    Yi Yang ; Jingkuan Song ; Zi Huang ; Zhigang Ma ; Sebe, Nicu ; Hauptmann, Alexander G.

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    15
  • Issue
    3
  • fYear
    2013
  • fDate
    41365
  • Firstpage
    572
  • Lastpage
    581
  • Abstract
    Multimedia data are usually represented by multiple features. In this paper, we propose a new algorithm, namely Multi-feature Learning via Hierarchical Regression for multimedia semantics understanding, where two issues are considered. First, labeling large amount of training data is labor-intensive. It is meaningful to effectively leverage unlabeled data to facilitate multimedia semantics understanding. Second, given that multimedia data can be represented by multiple features, it is advantageous to develop an algorithm which combines evidence obtained from different features to infer reliable multimedia semantic concept classifiers. We design a hierarchical regression model to exploit the information derived from each type of feature, which is then collaboratively fused to obtain a multimedia semantic concept classifier. Both label information and data distribution of different features representing multimedia data are considered. The algorithm can be applied to a wide range of multimedia applications and experiments are conducted on video data for video concept annotation and action recognition. Using Trecvid and CareMedia video datasets, the experimental results show that it is beneficial to combine multiple features. The performance of the proposed algorithm is remarkable when only a small amount of labeled training data are available.
  • Keywords
    multimedia systems; regression analysis; video signal processing; CareMedia video datasets; Trecvid video datasets; hierarchical regression; multifeature fusion; multifeature learning; multimedia analysis; multimedia data; multimedia semantics; Algorithm design and analysis; Manifolds; Multimedia communication; Semantics; Streaming media; Training; Training data; Action recognition; multiple feature fusion; semi-supervised learning; video concept annotation;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2012.2234731
  • Filename
    6384799