DocumentCode
3093711
Title
Integrating Visual and Textual Features for Web Image Clustering
Author
Xia, D.S. ; Xiang, Z.Q. ; Zou, Y.X.
Author_Institution
ADSPLAB, Peking Univ., Shenzhen, China
fYear
2015
fDate
20-22 April 2015
Firstpage
116
Lastpage
123
Abstract
With the explosive growth of Web and tremendous development of digital image processing technologies, the applications of Web image have attracted much attention, such as the Web image retrieval. Since the Web images are often with some related text tags, making use of both visual and textual features of Web image will help improving the accuracy of the Web image clustering. Researches show that Web image clustering methods, such as graph partitioning models and hyper graph partitioning models, didn´t make use the relations between texts and image simultaneously. In this paper, we explore to take both visual and textual features into account for Web image clustering by building a graph model and develop a novel iterative clustering method. With K clusters initialized, we calculate the occurrence frequency of each visual/textual feature over the j-th cluster (j = 1, 2,⋯, K), which is used to measure the significance of the feature for the j-th cluster. Then the likelihood of each image, which belongs to the j-th cluster, can be determined accordingly. Furthermore, a mixture model is built for the predicted feature linked to each image and the EM algorithm is adopted to get K component parameters which describe posterior probabilities of all clusters for each image. Then two K-dimensional vectors consisting of component parameters will be used to describe the image and adjust the cluster index of it. Several experiments have been performed with MIR-Flickr25K and IAPR TC-12 Benchmark datasets and the performance of the proposed Web image clustering algorithm is superior to that of the compared algorithm.
Keywords
graph theory; image texture; iterative methods; pattern clustering; probability; text analysis; EM algorithm; IAPR TC-12 benchmark dataset; K-component parameters; K-dimensional vectors; MIR-Flickr25K benchmark dataset; Web image clustering; Web image retrieval; accuracy improvement; cluster index; component parameters; digital image processing technologies; hypergraph partitioning models; image likelihood; iterative clustering method; occurrence frequency; posterior probabilities; text tags; textual feature; textual feature integration; visual feature; visual feature integration; Algorithm design and analysis; Clustering algorithms; Feature extraction; Multimedia communication; Semantics; Time complexity; Visualization; Web image clustering; mixtual model; ranking functions; visual and textual features;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Big Data (BigMM), 2015 IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-8687-3
Type
conf
DOI
10.1109/BigMM.2015.35
Filename
7153864
Link To Document