• DocumentCode
    678573
  • Title

    A weighted seeds affinity propagation clustering for efficient document mining

  • Author

    Kashyap, Preeti ; Shrivastava, Saurabh K. ; Ujjainiya, Babita

  • Author_Institution
    Dept. of Inf. Technol., SATI, Vidisha, India
  • fYear
    2013
  • fDate
    4-6 July 2013
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Clustering is widely used in data mining and learning systems. It is not one specific algorithm, but a general task to be solved which can be achieved by various algorithms that differ significantly in their notion of what constitutes a cluster and how to efficiently find them. However the clustering is not easy task especially for the complex datasets like text mining where the information does not depends only on terms frequency. This paper presents an effective approach for dealing with similar problems. The proposed algorithm is a category dependent weighted seeds affinity clustering algorithm. The advantage of the proposed algorithm is that clusters can be easily modified according to the field of interest of the user. The superiority of the proposed algorithm is also validated by the simulation results comparison using Reuters-21578 dataset. Results shows improvement over k-means, Affinity and Seeds Affinity Algorithm.
  • Keywords
    data mining; document handling; learning systems; pattern clustering; Reuters-21578 dataset; category dependent weighted seeds affinity clustering algorithm; complex datasets; data mining; efficient document mining; k-means algorithm; learning systems; terms frequency; text mining; weighted seeds affinity propagation clustering; Algorithm design and analysis; Availability; Clustering algorithms; Entropy; Indexes; Text mining; Affinity Propagation Clustering; Clustering; K-Means; Text Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing, Communications and Networking Technologies (ICCCNT),2013 Fourth International Conference on
  • Conference_Location
    Tiruchengode
  • Print_ISBN
    978-1-4799-3925-1
  • Type

    conf

  • DOI
    10.1109/ICCCNT.2013.6726723
  • Filename
    6726723