Title :
Document categorisation by genetic algorithms
Author :
Liu, Chih-Hung ; Lu, Cheng-Che ; Lee, Wei-Po
Author_Institution :
Dept. of Manage. Inf. Syst., Nat. Pingtung Univ. of Sci. & Technol., Taiwan
Abstract :
Today, it is easy to provide information to and retrieve information from the Internet. However, the problem of information overload has to be overcome. One of the main issues to be addressed for the information overload problem is document classification. We present an evolutionary approach to automatically categorize documents into appropriate categories. Our approach deals with different categories of documents separately: it evolves a numerical list that consists of the corresponding weights of the feature words for each class of documents. Experimental results show that our approach can easily evolve the classifiers of numerical lists, and that the evolved classifiers perform better than those constructed by the traditional k-nearest neighbors approach
Keywords :
Internet; classification; genetic algorithms; information retrieval; pattern classification; Internet; document categorisation; document classification; evolutionary approach; feature words; genetic algorithms; information overload; k-nearest neighbors approach; numerical list; Buildings; Business; Data mining; Genetic algorithms; IP networks; Information filtering; Information filters; Information retrieval; Internet; Management information systems;
Conference_Titel :
Systems, Man, and Cybernetics, 2000 IEEE International Conference on
Conference_Location :
Nashville, TN
Print_ISBN :
0-7803-6583-6
DOI :
10.1109/ICSMC.2000.886614