• Title of article

    Content-Based Retrieval using Heuristic Search

  • Author/Authors

    Papadias، Dimitris نويسنده , , Mantzourogiannis، Marios نويسنده , , Kalnis، Panos نويسنده , , Mamoulis، Nikos نويسنده , , Ahmad، Ishfaq نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 1999
  • Pages
    -167
  • From page
    168
  • To page
    0
  • Abstract
    In this paper we examine the question of query parsing for World Wide Web queries and present a novel method for phrase recognition and expansion. Given a training corpus of approximately 16 million Web queries and a handwritten context-free grammar, the EM algorithm is used to estimate the parameters of a probabilistic context-free grammar (PCFG) with a system developed by Carroll [5]. We use the PCFG to compute the most probable parse for a user query, reflecting linguistic structure and word usage of the domain being parsed. The optimal syntactic parse for a user query thus obtained is employed for phrase recognition and expansion. Phrase recognition is used to increase retrieval precision; phrase expansion is applied to make the best use possible of very short Web queries.
  • Keywords
    MMIR , image indexing/retrieval , content-based indexing/retrieval
  • Journal title
    SIGIR FORUM
  • Serial Year
    1999
  • Journal title
    SIGIR FORUM
  • Record number

    16670