DocumentCode
678573
Title
A weighted seeds affinity propagation clustering for efficient document mining
Author
Kashyap, Preeti ; Shrivastava, Saurabh K. ; Ujjainiya, Babita
Author_Institution
Dept. of Inf. Technol., SATI, Vidisha, India
fYear
2013
fDate
4-6 July 2013
Firstpage
1
Lastpage
7
Abstract
Clustering is widely used in data mining and learning systems. It is not one specific algorithm, but a general task to be solved which can be achieved by various algorithms that differ significantly in their notion of what constitutes a cluster and how to efficiently find them. However the clustering is not easy task especially for the complex datasets like text mining where the information does not depends only on terms frequency. This paper presents an effective approach for dealing with similar problems. The proposed algorithm is a category dependent weighted seeds affinity clustering algorithm. The advantage of the proposed algorithm is that clusters can be easily modified according to the field of interest of the user. The superiority of the proposed algorithm is also validated by the simulation results comparison using Reuters-21578 dataset. Results shows improvement over k-means, Affinity and Seeds Affinity Algorithm.
Keywords
data mining; document handling; learning systems; pattern clustering; Reuters-21578 dataset; category dependent weighted seeds affinity clustering algorithm; complex datasets; data mining; efficient document mining; k-means algorithm; learning systems; terms frequency; text mining; weighted seeds affinity propagation clustering; Algorithm design and analysis; Availability; Clustering algorithms; Entropy; Indexes; Text mining; Affinity Propagation Clustering; Clustering; K-Means; Text Mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing, Communications and Networking Technologies (ICCCNT),2013 Fourth International Conference on
Conference_Location
Tiruchengode
Print_ISBN
978-1-4799-3925-1
Type
conf
DOI
10.1109/ICCCNT.2013.6726723
Filename
6726723
Link To Document