• DocumentCode
    1639142
  • Title

    Dynamic and adaptive self organizing maps applied to high dimensional large scale text clustering

  • Author

    Feng, Zhonghui ; Bao, Junpeng ; Shen, Junyi

  • Author_Institution
    Inst. of Comput. Software, Xi´´an Jiaotong Univ., Xi´´an, China
  • fYear
    2010
  • Firstpage
    348
  • Lastpage
    351
  • Abstract
    The self organizing maps(SOM) has been used as a tool for mapping high-dimensional input data into a low-dimensional feature map, which has significant advantages for text clustering applications. In this paper, a novel dynamic and adaptive SOM algorithm applied to high dimensional large scale text clustering is proposed. The characteristic feature of this novel neural network model is its dynamic architecture which grows (when the similarity between input pattern (text vector) and weight vector of the winning node is smaller than a given threshold) during its training process to find the inherent topology structure of the document set. By using unsupervised competitive learning in network, the weight vectors of the winning node and its nearest neighbors are adjusted adaptively (where learning rate is related to similarity in amended learning rule) in this algorithm. The results of the experiments indicated that the algorithm successfully improve quality of text clustering and learning speed of neural network.
  • Keywords
    learning (artificial intelligence); pattern clustering; self-organising feature maps; text analysis; adaptive self organizing maps; large scale text clustering; neural network model; topology structure; unsupervised competitive learning; Clustering algorithms; Computer architecture; Heuristic algorithms; Optimization; Self organizing feature maps; Training; dynamic and adaptive self organizing maps; text clustering; text vector;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering and Service Sciences (ICSESS), 2010 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-6054-0
  • Type

    conf

  • DOI
    10.1109/ICSESS.2010.5552449
  • Filename
    5552449