DocumentCode
2002146
Title
Searching very large bodies of data using a transparent peer-to-peer proxy
Author
Taylor, Mike ; Cromme, Marc
Author_Institution
Index Data, London, UK
fYear
2005
fDate
22-26 Aug. 2005
Firstpage
1049
Lastpage
1053
Abstract
While individual data stores are increasingly large, the aggregate size of the Internet dwarfs them all and always will. We consider an approach to searching rich documents across a very large network of individual data stores using a transparent peer-to-peer proxy. This approach is dependent on the use of a standardised search-and-retrieve protocol sufficiently rich to enable semantics to be induced on both its documents and its queries. Candidate protocols include the mature Z39.50 and the more recent SRW/U, of which the latter is considered more "Web-friendly". Networks of the peers underlying this approach to large-repository search and retrieval may take on widely differing topologies, and queries may be routed in widely different ways. Optimal values of tuning parameters may be determined using an evolutionary system in which simulations of different configurations compete against each other. The European collaborative project Alvis is using the approach outlined in this paper to build a semantic peer-to-peer search engine aggregated across multiple subject-specific repositories. Among the problems still to be solved, the matter of how to merge results from multiple peers is the most difficult.
Keywords
Internet; information retrieval; peer-to-peer computing; protocols; search engines; Internet; candidate protocol; data stores; document searching; evolutionary system; retrieve protocol; search engine; search protocol; transparent peer-to-peer proxy; ANSI standards; Access protocols; Aggregates; Bit error rate; ISO standards; Internet; Libraries; Network topology; Peer to peer computing; Standards development;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
ISSN
1529-4188
Print_ISBN
0-7695-2424-9
Type
conf
DOI
10.1109/DEXA.2005.170
Filename
1508413
Link To Document