• DocumentCode
    1698543
  • Title

    Improved SOM Algorithm-HDSOM Applied in Text Clustering

  • Author

    Sun Ai-xiang

  • Author_Institution
    Manage. Inst., Shandong Univ. of Technol., Zibo, China
  • fYear
    2010
  • Firstpage
    306
  • Lastpage
    309
  • Abstract
    SOM neural network is one of the most commonly used Clustering algorithm in the text clustering. The initial connection weights of SOM neural network will affect the degree of convergence. If the Initial connection weights are not set appropriate, that will cause in a long wandering around the local minimum, accordingly lower the speed of convergence, or even cause local convergence or not convergence. Initializing the connection weights closer to the center of each category can highten the speed of convergence.Because text-data-intensive area may contain category center or close to category center , this paper presents a hierarchical clustering method to detect text-data-intensive areas and use the center of the K detected text-data-intensive areas to initialize the connection weight of SOM neural network, in order to improve the speed of SOM neural network convergence. The experimental results showed that: ensuring the effectiveness of text clustering, the text clustering speed is greatly improved.
  • Keywords
    convergence; neural nets; pattern clustering; self-organising feature maps; text analysis; HDSOM; SOM algorithm; SOM neural network convergence; category center; clustering algorithm; hierarchical clustering method; initial connection weights; local convergence; text clustering; text-data-intensive area; Artificial neural networks; Clustering algorithms; Clustering methods; Convergence; Neurons; Text mining; Training; SOM neural network; convergence; text clustering; weight;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Information Networking and Security (MINES), 2010 International Conference on
  • Conference_Location
    Nanjing, Jiangsu
  • Print_ISBN
    978-1-4244-8626-7
  • Electronic_ISBN
    978-0-7695-4258-4
  • Type

    conf

  • DOI
    10.1109/MINES.2010.74
  • Filename
    5670834