مرکز منطقه ای اطلاع رساني علوم و فناوري - A New Density-Based Method for Reducing the Amount of Training Data in k-NN Text Classification

DocumentCode :

1936379

Title :

A New Density-Based Method for Reducing the Amount of Training Data in k-NN Text Classification

Author :

Yuan, Fang ; Yang, Liu ; Yu, Ge

Author_Institution :

Hebei Univ., Baoding

Volume :

fYear :

2007

fDate :

19-22 Aug. 2007

Firstpage :

3372

Lastpage :

3376

Abstract :

With the rapid development of WWW, text classification has become the key technology in organizing and processing large amount of text data. As a simple, effective and nonparametric classification method, k-NN method is widely used in text classification. But k-NN clasifier not only has large computational demands, but also may decrease the precision of classification because of uneven density of training data. In this paper, a new density-based method for reducing the amount of training data is presented, which not only reduces the computational demands of k-NN classifier, but also improves the classification precision. The experiments show that the new method has better performance than the traditional k-NN method.

Keywords :

classification; text analysis; density-based method; k-nearest neighbor classifier; text classification; Computer science; Cybernetics; Educational institutions; Internet; Machine learning; Mathematics; Nearest neighbor searches; Testing; Text categorization; Training data; Density; K-Nearest Neighbor; Text classification; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Cybernetics, 2007 International Conference on

Conference_Location :

Hong Kong

Print_ISBN :

978-1-4244-0973-0

Electronic_ISBN :

978-1-4244-0973-0

Type :

conf

DOI :

10.1109/ICMLC.2007.4370730

Filename :

4370730

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1936379