• DocumentCode
    3310958
  • Title

    Research of text clustering based on fuzzy granular computing

  • Author

    Xia, Zhang ; Yixin, Yin ; Mingzhu, Xu ; Hailong, Zhao

  • Author_Institution
    Comput. Center, Hebei Univ. of Econ. & Bus., Shijiazhuang, China
  • fYear
    2009
  • fDate
    8-11 Aug. 2009
  • Firstpage
    288
  • Lastpage
    291
  • Abstract
    The typical algorithm of text clustering is a ldquoHard Partitionrdquo one, Actually, Chinese text is better to treat with ldquoSoft Partitionrdquo for its diversity and largeness. The fuzzy-set theory supply a powerful analyzing tool to this ldquoSoft partitionrdquo. Traditional fuzzy text clustering methods mostly are getting the fuzzy equivalent matrix or fuzzy division by iterating the matrix of membership degree, huge storage space is necessary for that process. The text clustering based on fuzzy granular computing will work as: first provide a normalized distance function in the fuzzy granularity space of text set, then use the function to do a dynamic clustering work to text who has a less distance than granularity. Approved by the test, this method has such advantages on reducing the computing complexity and space complexity, suitable for the status that many samples need to be processed.
  • Keywords
    fuzzy set theory; iterative methods; matrix algebra; pattern clustering; text analysis; fuzzy equivalent matrix; fuzzy granular computing; fuzzy granularity space; fuzzy-set theory; hard partition; iterative method; normalized distance function; soft partition; text clustering; Clustering algorithms; Clustering methods; Fuzzy sets; Internet; Machine learning; Machine learning algorithms; Mathematics; Partitioning algorithms; Power supplies; Railway engineering; fuzzy; granular computing; normalized distance function; text cluster;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology, 2009. ICCSIT 2009. 2nd IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-4519-6
  • Electronic_ISBN
    978-1-4244-4520-2
  • Type

    conf

  • DOI
    10.1109/ICCSIT.2009.5234519
  • Filename
    5234519