Title :
An Improved Growing LVQ for Text Classification
Author :
Wang, Xiujun ; Shen, Hong
Author_Institution :
Dept. of Comput. Sci. & Technol., Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
KNN as a simple classification method has been widely applied in text classification. There are two problems in KNN-based text classification: the large computation load and the deterioration of classification accuracy caused by the uneven distribution of training samples. To solve these problems, we propose a new growing LVQ method and apply it to text classification based on minimizing the increment of learning errors. Our method can generate a representative sample (reference sample) set after one phase of training of sample set, and hence has a strong learning ability. The experiment shows the improvement on both time and accuracy. For our algorithm, we also proposed a learning sequence arrangement method which performs better than others.
Keywords :
learning (artificial intelligence); pattern classification; text analysis; KNN-based text classification; improved growing LVQ; learning errors; learning sequence arrangement; learning vector quantification; Classification tree analysis; Computer science; Distributed computing; Electronic mail; Fuzzy systems; Machine learning; Machine learning algorithms; Support vector machines; Text categorization; Wavelet domain;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.340