• DocumentCode
    358
  • Title

    Mining User Queries with Markov Chains: Application to Online Image Retrieval

  • Author

    Raftopoulos, Konstantinos A. ; Ntalianis, Klimis S. ; Sourlas, Dionyssios D. ; Kollias, Stefanos D.

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Nat. Tech. Univ. of Athens (NTUA), Athens, Greece
  • Volume
    25
  • Issue
    2
  • fYear
    2013
  • fDate
    Feb. 2013
  • Firstpage
    433
  • Lastpage
    447
  • Abstract
    We propose a novel method for automatic annotation, indexing and annotation-based retrieval of images. The new method, that we call Markovian Semantic Indexing (MSI), is presented in the context of an online image retrieval system. Assuming such a system, the users´ queries are used to construct an Aggregate Markov Chain (AMC) through which the relevance between the keywords seen by the system is defined. The users´ queries are also used to automatically annotate the images. A stochastic distance between images, based on their annotation and the keyword relevance captured in the AMC, is then introduced. Geometric interpretations of the proposed distance are provided and its relation to a clustering in the keyword space is investigated. By means of a new measure of Markovian state similarity, the mean first cross passage time (CPT), optimality properties of the proposed distance are proved. Images are modeled as points in a vector space and their similarity is measured with MSI. The new method is shown to possess certain theoretical advantages and also to achieve better Precision versus Recall results when compared to Latent Semantic Indexing (LSI) and probabilistic Latent Semantic Indexing (pLSI) methods in Annotation-Based Image Retrieval (ABIR) tasks.
  • Keywords
    Markov processes; data mining; image retrieval; probability; ABIR; AMC; CPT; MSI; Markovian semantic indexing; aggregate Markov Chain; annotation based image retrieval; annotation based retrieval; automatic annotation; cross passage time; indexing based retrieval; keyword space; mining user queries; online image retrieval; pLSI; probabilistic latent semantic indexing; stochastic distance; Convergence; Eigenvalues and eigenfunctions; Image retrieval; Indexing; Markov processes; Probabilistic logic; Semantics; Markovian semantic indexing; annotation-based image retrieval; image annotation; query mining;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2011.219
  • Filename
    6051433