• DocumentCode
    3176699
  • Title

    Termset-Based Indexing and Query Processing in P2P Search

  • Author

    Zhenhua, Wang ; Derong, Shen ; Ge, Yu

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Northeastern Univ. (NEU), Shenyang, China
  • Volume
    3
  • fYear
    2009
  • fDate
    25-27 Dec. 2009
  • Firstpage
    263
  • Lastpage
    266
  • Abstract
    Multi-term query is a common issue in information retrieval system. In large-scale P2P information retrieval, the method of indexing and query processing based on single-term results in large bandwidth cost. We take into account the correlation among terms and propose a termset-based indexing and query processing method suited for information retrieval in structured P2P overlay. Employing statistics, metadata and query log, we construct a dynamic termset corpus, and the index is built based on termset. When processing query, the peer extracts the termsets from the query terms, and each termset is treated as a key. Several methods are applied to reduce bandwidth consumption. We also present a method of query expansion to be a complement when there are no sufficient results. The experiments show that our method has good performance, and it is suitable for large-scale distributed information retrieval.
  • Keywords
    indexing; information retrieval systems; meta data; peer-to-peer computing; query processing; P2P search; dynamic termset corpus; information retrieval system; large bandwidth cost; large-scale distributed information retrieval; metadata; multiterm query; query expansion method; query log; query processing; statistics; structured P2P overlay; termset based indexing; Bandwidth; Costs; Data mining; Indexing; Information retrieval; Large-scale systems; Probability; Query processing; Statistics; Telecommunication traffic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science-Technology and Applications, 2009. IFCSTA '09. International Forum on
  • Conference_Location
    Chongqing
  • Print_ISBN
    978-0-7695-3930-0
  • Electronic_ISBN
    978-1-4244-5423-5
  • Type

    conf

  • DOI
    10.1109/IFCSTA.2009.304
  • Filename
    5384831