• DocumentCode
    918281
  • Title

    Semisupervised Query Expansion with Minimal Feedback

  • Author

    Okabe, Masayuki ; Yamada, Seiji

  • Author_Institution
    Toyohashi Univ. of Technol., Toyohashi
  • Volume
    19
  • Issue
    11
  • fYear
    2007
  • Firstpage
    1585
  • Lastpage
    1589
  • Abstract
    Query expansion is an information retrieval technique in which new query terms are selected to improve search performance. Although useful terms can be extracted from documents whose relevance is already known, it is difficult to get enough of such feedback from a user in actual use. We propose a query expansion method that performs well even if a user makes practically minimum effort, that is, chooses only a single relevant document. To improve searches in these conditions, we made two refinements to a well-known query expansion method. One uses transductive learning to obtain pseudorelevant documents, thereby increasing the total number of source documents from which expansion terms can be extracted. The other is a modified parameter estimation method that aggregates the predictions of multiple learning trials to sort candidate terms for expansion by importance. Experimental results show that our method outperforms traditional methods and is comparable to a state-of-the-art method.
  • Keywords
    document handling; learning (artificial intelligence); query formulation; information retrieval; information search; machine learning; minimal feedback; modified parameter estimation; pseudorelevant documents; query formulation; query terms; search performance; semisupervised query expansion; source documents; transductive learning; Aggregates; Data mining; Feedback; Frequency; Humans; Information retrieval; Machine learning; Parameter estimation; Statistics; Support vector machines; H.3.3.f Relevance feedback; Information Search and Retrieval; Machine learning; Query formulation;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2007.190646
  • Filename
    4339221