DocumentCode :
3384363
Title :
Examples initialization in Chinese text categorization
Author :
Shi Cheng ; Yuhui Shi ; Quande Qin
Author_Institution :
Dept. of Electr. Eng. & Electron., Univ. of Liverpool, Liverpool, UK
fYear :
2013
fDate :
23-25 March 2013
Firstpage :
967
Lastpage :
971
Abstract :
The generalization ability is a fundamental goal for a classifier in machine learning. The categorization results are influenced by the initialized examples in a nearest neighbor classifier. The generalization ability beyond the examples in training set is important in categorization. In this paper, we propose a particle swarm optimization with k means clustering algorithm for the nearest neighbor classifier´s examples initialization to improve categorization performances. This classifier utilizes an iterative strategy, and the classifier´s example initialization is based on clusters center and random examples. The new classifier is tested on a Chinese text corpus. The proposed classifier is compared against the nearest neighbor classifier with random initialization.
Keywords :
natural language processing; particle swarm optimisation; text analysis; Chinese text categorization; Chinese text corpus; categorization results; examples initialization; generalization ability; iterative strategy; k means clustering algorithm; machine learning; nearest neighbor classifier; particle swarm optimization; random initialization; training set; Clustering algorithms; Error analysis; Measurement; Optimization; Particle swarm optimization; Text categorization; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Technology (ICIST), 2013 International Conference on
Conference_Location :
Yangzhou
Print_ISBN :
978-1-4673-5137-9
Type :
conf
DOI :
10.1109/ICIST.2013.6747699
Filename :
6747699
Link To Document :
بازگشت