• DocumentCode
    3461695
  • Title

    Document Clustering Description Based on Combination Strategy

  • Author

    Zhang, Chengzhi

  • Author_Institution
    Inst. of Sci. & Tech. Inf. of China, Beijing, China
  • fYear
    2009
  • fDate
    7-9 Dec. 2009
  • Firstpage
    1084
  • Lastpage
    1088
  • Abstract
    Document clustering description is a problem of labeling the clustered results of document collection clustering. It can help users determine whether one of the clusters is relevant to users´ information require. Therefore, labeling a clustered set of documents is an important and challenging work in document clustering applications. The DCF (description comes first) method can generate document clustering description. For the clustering description base on DCF is generate before document clustering, there is ´semantic interval´ between clustering description and cluster central vector. So, it contradicts to the intuition of ´first clustering, second description´, and decreases the readability of clustering description. A method based on combination strategy, i.e. combination of the DCF and DCL (description comes last) is proposed to solve the problem of the weak readability of clustering description in this paper. Experimental results show that the method is effective, and the method is used to describe the search result clustering.
  • Keywords
    document handling; pattern clustering; cluster central vector; combination strategy; description comes first method; description comes last method; document clustering description; document collection clustering; Clustering algorithms; Data mining; Frequency; Information management; Labeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovative Computing, Information and Control (ICICIC), 2009 Fourth International Conference on
  • Conference_Location
    Kaohsiung
  • Print_ISBN
    978-1-4244-5543-0
  • Type

    conf

  • DOI
    10.1109/ICICIC.2009.178
  • Filename
    5412632