Title :
A Distributed Flocking Approach for Information Stream Clustering Analysis
Author :
Cui, Xiaohui ; Potok, Thomas E.
Author_Institution :
Oak Ridge Nat. Lab., TN
Abstract :
Intelligence analysts are currently overwhelmed with the amount of information streams generated everyday. There is a lack of comprehensive tool that can real-time analyze the information streams. Document clustering analysis plays an important role in improving the accuracy of information retrieval. However, most clustering technologies can only be applied for analyzing the static document collection because they normally require a large amount of computation resource and long time to get accurate result. It is very difficult to cluster a dynamic changed text information streams on an individual computer. Our early research has resulted in a dynamic reactive flock clustering algorithm which can continually refine the clustering result and quickly react to the change of document contents. This character makes the algorithm suitable for cluster analyzing dynamic changed document information, such as text information stream. Because of the decentralized character of this algorithm, a distributed approach is a very natural way to increase the clustering speed of the algorithm. In this paper, we present a distributed multi-agent flocking approach for the text information stream clustering and discuss the decentralized architectures and communication schemes for load balance and status information synchronization in this approach
Keywords :
document handling; information analysis; information retrieval; multi-agent systems; pattern clustering; resource allocation; distributed flocking approach; distributed multiagent flocking approach; document clustering analysis; document contents; dynamic changed text information streams; dynamic reactive flock clustering; information retrieval; information stream clustering analysis; load balancing; static document collection; status information synchronization; text information stream clustering; Algorithm design and analysis; Biological system modeling; Birds; Clustering algorithms; Data mining; Data visualization; Information analysis; Information retrieval; Laboratories; Text analysis;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2006. SNPD 2006. Seventh ACIS International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
0-7695-2611-X
DOI :
10.1109/SNPD-SAWN.2006.2