Title :
A new method of text categorization based on PA and Kohonen network
Author :
Xu, Jian-Suo ; Wang, Zheng-Ou
Author_Institution :
Inst. of Syst. Eng., Tianjin Univ., China
Abstract :
This paper presents a new method of text categorization by using the theory of pattern aggregation (PA) and Kohonen network. The Kohonen network is applied to realizing text categorization, which has a defect of too slowly speed of training, and so we apply supervising method to training network. Therefore, the speed and the precision of classifying are improved. However, to text vector of high dimension, the speed of classifying is still very slow using Kohonen network. Even the result of text categorization cannot be acquired. The new method establishes vector space model of term weight by the theory of PA, which enhances the function of the words from the viewpoint of categorization effect, and decreases the dimension of vector through eliminating redundant features. Therefore the new method advances largely the speed and the precision of text categorization.
Keywords :
learning (artificial intelligence); pattern classification; self-organising feature maps; text analysis; Kohonen network; pattern aggregation theory; pattern classification; supervising method; text categorization; training network; training speed; vector dimension reduction; vector space model; Cybernetics; Entropy; Feature extraction; Frequency; Learning systems; Machine learning; Mutual information; Pattern recognition; Text categorization; Tree data structures;
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
DOI :
10.1109/ICMLC.2004.1381978