• DocumentCode
    3093711
  • Title

    Integrating Visual and Textual Features for Web Image Clustering

  • Author

    Xia, D.S. ; Xiang, Z.Q. ; Zou, Y.X.

  • Author_Institution
    ADSPLAB, Peking Univ., Shenzhen, China
  • fYear
    2015
  • fDate
    20-22 April 2015
  • Firstpage
    116
  • Lastpage
    123
  • Abstract
    With the explosive growth of Web and tremendous development of digital image processing technologies, the applications of Web image have attracted much attention, such as the Web image retrieval. Since the Web images are often with some related text tags, making use of both visual and textual features of Web image will help improving the accuracy of the Web image clustering. Researches show that Web image clustering methods, such as graph partitioning models and hyper graph partitioning models, didn´t make use the relations between texts and image simultaneously. In this paper, we explore to take both visual and textual features into account for Web image clustering by building a graph model and develop a novel iterative clustering method. With K clusters initialized, we calculate the occurrence frequency of each visual/textual feature over the j-th cluster (j = 1, 2,⋯, K), which is used to measure the significance of the feature for the j-th cluster. Then the likelihood of each image, which belongs to the j-th cluster, can be determined accordingly. Furthermore, a mixture model is built for the predicted feature linked to each image and the EM algorithm is adopted to get K component parameters which describe posterior probabilities of all clusters for each image. Then two K-dimensional vectors consisting of component parameters will be used to describe the image and adjust the cluster index of it. Several experiments have been performed with MIR-Flickr25K and IAPR TC-12 Benchmark datasets and the performance of the proposed Web image clustering algorithm is superior to that of the compared algorithm.
  • Keywords
    graph theory; image texture; iterative methods; pattern clustering; probability; text analysis; EM algorithm; IAPR TC-12 benchmark dataset; K-component parameters; K-dimensional vectors; MIR-Flickr25K benchmark dataset; Web image clustering; Web image retrieval; accuracy improvement; cluster index; component parameters; digital image processing technologies; hypergraph partitioning models; image likelihood; iterative clustering method; occurrence frequency; posterior probabilities; text tags; textual feature; textual feature integration; visual feature; visual feature integration; Algorithm design and analysis; Clustering algorithms; Feature extraction; Multimedia communication; Semantics; Time complexity; Visualization; Web image clustering; mixtual model; ranking functions; visual and textual features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Big Data (BigMM), 2015 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-8687-3
  • Type

    conf

  • DOI
    10.1109/BigMM.2015.35
  • Filename
    7153864