• DocumentCode
    3230369
  • Title

    Proof: A DHT-Based Peer-to-Peer Search Engine

  • Author

    Yang, Kai-Hsiang ; Ho, Jan-Ming

  • Author_Institution
    Inst. of Inf. Sci., Acad. Sinica, Taipei
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    702
  • Lastpage
    708
  • Abstract
    In this paper we focus on building a large scale keyword search service over structured peer-to-peer (P2P) networks. Current state-of-the-art keyword search approaches for structured P2P systems are based on inverted list intersection. However, the biggest challenge in those approaches is that when the indices are distributed over peers, a simple query may cause a large amount of data to be transmitted over the network. We propose a new P2P keyword search scheme, called "Proof", to reduce network traffic for queries. The key idea is storing a content summary for each Web page in the inverted list, so that a query can be processed by only transmitting a small size of candidate results. Our simulation results showed that, compared with previous DHT-based P2P systems, Proof can dramatically reduce network traffic and computation time. It provides 100% precision and 90.09% recall of search results, at an acceptable cost of storage overhead, even when the number of peers and documents increases continually
  • Keywords
    Internet; file organisation; peer-to-peer computing; query processing; search engines; text analysis; DHT-based peer-to-peer search engine; Proof P2P keyword search service; Web page; distributed hash table; inverted list; query processing; Buildings; Computational modeling; Computer networks; Keyword search; Large-scale systems; Peer to peer computing; Search engines; Telecommunication traffic; Traffic control; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, 2006. WI 2006. IEEE/WIC/ACM International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    0-7695-2747-7
  • Type

    conf

  • DOI
    10.1109/WI.2006.137
  • Filename
    4061456