DocumentCode :
1037411
Title :
Scalable Retrieval and Mining With Optimal Peer-to-Peer Configuration
Author :
Chen, Jiann-Jone ; Hu, Chia-Jung ; Su, Chun-Rong
Author_Institution :
Nat. Taiwan Univ. of Sci. & Technol., Taipei
Volume :
10
Issue :
2
fYear :
2008
Firstpage :
209
Lastpage :
220
Abstract :
We proposed to utilize the scalable peer-to-peer network to perform the content-based image retrieval and mining, i.e, P2P-CBIRM. The decentralized unstructured P2P model with certain overheads, i.e., peer clustering and update procedures, is adopted to compromise with the structured one while still reserving flexible routing control when peers join/leave or network fails. The peer CBIRM engine is designed to utilize multi-instance query with multi-feature types to effectively reduce network traffic while maintaining high retrieval accuracy. It helps to enhance the knowledge discovery and image data mining capability. The proposed P2P-CBIRM system provides the scalable retrieval and mining function that the query scope and retrieval accuracy can be adaptively and progressively controlled. To improve the query efficiency (recall-rate/query-scope), it effectively utilizes both: 1) forwarding query message (forward phase) to reduce the query scope and 2) transmitting retrieval results (backward phase) such that activated peers keep filtering high similarity images on the link-path toward the query peer. Experiments show that the query efficiency of the scalable retrieval approach is better than previous methods, i.e., firework query model and breadth-first search. It provides a scalable knowledge discovery platform for efficient image data mining applications. We also proposed to optimally configure the P2P-CBIRM system such that, under a certain number of online users, it would yield the highest recall rate. Simulations demonstrate that, with the optimal configuration, recall rates can be improved to 2.5 to 3 times larger while the network traffic of each peer is reduced to 30% of the original, under the same number of on-line users.
Keywords :
content-based retrieval; data mining; image retrieval; information filtering; pattern clustering; peer-to-peer computing; search engines; telecommunication network routing; telecommunication traffic; content-based image retrieval; decentralized unstructured P2P model; flexible routing control; image data mining capability; knowledge discovery; multiinstance query; network traffic; optimal peer-to-peer configuration; peer CBIRM engine; peer clustering; scalable peer-to-peer network; similarity image filtering; update procedures; Communication system traffic control; Content based retrieval; Control systems; Data mining; Engines; Filtering; Image retrieval; Peer to peer computing; Routing; Traffic control; Content-based information retrieval and mining; image database; multi-instance query; optimal system configuration; peer-to-peer (P2P) networks; scalable retrieval and mining;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2007.911821
Filename :
4432613
Link To Document :
بازگشت