• DocumentCode
    105187
  • Title

    Augmenting Image Descriptions Using Structured Prediction Output

  • Author

    Yahong Han ; Xingxing Wei ; Xiaochun Cao ; Yi Yang ; Xiaofang Zhou

  • Author_Institution
    Tianjin Key Lab. of Cognitive Comput. & Applic., Tianjin Univ., Tianjin, China
  • Volume
    16
  • Issue
    6
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    1665
  • Lastpage
    1676
  • Abstract
    The need for richer descriptions of images arises in a wide spectrum of applications ranging from image understanding to image retrieval. While the Automatic Image Annotation (AIA) has been extensively studied, image descriptions with the output labels lack sufficient information. This paper proposes to augment image descriptions using structured prediction output. We define a hierarchical tree-structured semantic unit to describe images, from which we can obtain not only the class and subclass one image belongs to, but also the attributes one image has. After defining a new feature map function of structured SVM, we decompose the loss function into every node of the hierarchical tree-structured semantic unit and then predict the tree-structured semantic unit for testing images. In the experiments, we evaluate the performance of the proposed method on two open benchmark datasets and compare with the state-of-the-art methods. Experimental results show the better prediction performance of the proposed method and demonstrate the strength of augmenting image descriptions.
  • Keywords
    image retrieval; support vector machines; AIA; SVM feature map function; automatic image annotation; hierarchical tree-structured semantic unit; image descriptions; image understanding; loss function; structured prediction output; support vector machines; Educational institutions; Feature extraction; Prediction algorithms; Semantics; Support vector machines; Training; Visualization; Image descriptions; image annotation; structured learning; tree-structured semantic unit;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2014.2321530
  • Filename
    6810013