Title :
Text Classification by Combining Different Distance Functions withWeights
Author :
Yamada, Takahiro ; Yamashita, Kyohei ; Ishii, Naohiro ; Iwata, Kazunori
Author_Institution :
Aichi Inst. of Technol., Toyota
Abstract :
Since data is becoming greatly large in the networks, the machine classification of the text data, is not easy under these computing circumstances. Though the k-nearest neighbor (kNN) classification is a simple and effective classification approach, the improving performance of the classifier is still attractive to cope with the high accuracy processing. In this paper, the kNN is improved by applying the different distance functions with weights to measure data from the multi-view points. Then, the weights for the optimization are computed by the genetic algorithms. After the learning of the trained data, the unknown data is classified by combining the multiple distance functions and ensemble computations of the kNN. In this paper we present a new approach to combine multiple kNN classifiers based on different distance functions, which improve the performance of the k-nearest neighbor method. The proposed combining algorithm shows the higher generalization accuracy when compared to other conventional learning algorithms
Keywords :
genetic algorithms; learning (artificial intelligence); pattern classification; text analysis; distance functions; genetic algorithm; k-nearest neighbor classification; learning algorithm; machine classification; text classification; Computer networks; Data communication; Euclidean distance; Genetic algorithms; Information technology; Nearest neighbor searches; Text categorization; Training data; Voting; Weight measurement;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2006. SNPD 2006. Seventh ACIS International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
0-7695-2611-X
DOI :
10.1109/SNPD-SAWN.2006.69