Title :
Scalable Semantic Search with Hybrid Concept Index over Structure Peer-to-Peer Network
Author :
Ma, Wenhui ; Wang, Gang ; Liu, Jing
Author_Institution :
Coll. of Inf. Technol. & Sci., Nankai Univ., Tianjin
Abstract :
The primary challenge in developing a peer-to- peer(P2P) file sharing system is implementing an efficient keyword search mechanism. Current keyword search approaches for structured P2P networks are built on the distributed inverted index by keywords. However, when executing multiple-attribute queries, they suffer from the problem of unscalable bandwidth consumption. Moreover, these approaches only support literally word match, not taking into account the meaning of word. In this paper, we propose an efficient keyword search mechanism over structure P2P network. Peers use a shared ontology to describe the content of a document and the subject of a query. A distributed hybrid concept index is constructed, which efficiently supports the query routing and matching, and avoids the intersection of inverted list among peers, which is cause of unscallabe network bandwidth consumption. Based on the semantic similarity between the subjects of queries and the contents of documents, peers can get results matching their queries semantically, instead of literally word match. Simulation experiments show that keyword search with the approach proposed in this paper is much less on bandwidth costs and much higher on retrieval perform than that based on standard inverted index by keywords.
Keywords :
peer-to-peer computing; query processing; P2P file sharing system; distributed hybrid concept index; distributed inverted index; keyword search mechanism; query matching; query routing; scalable semantic search; semantic similarity; structure peer-to- peer network; Bandwidth; Costs; Educational institutions; Information resources; Information technology; Keyword search; Natural languages; Ontologies; Peer to peer computing; Query processing;
Conference_Titel :
Grid and Cooperative Computing, 2007. GCC 2007. Sixth International Conference on
Conference_Location :
Los Alamitos, CA
Print_ISBN :
0-7695-2871-6
DOI :
10.1109/GCC.2007.110