DocumentCode :
2510016
Title :
A semi-supervised k-nearest neighbor algorithm based on data editing
Author :
Yongfang, Xie ; Youwei, Jiang ; Mingzhu, Tang
Author_Institution :
Inst. of Inf. Sci. & Eng., Central South Univ., Changsha, China
fYear :
2011
fDate :
23-25 May 2011
Firstpage :
41
Lastpage :
45
Abstract :
YATSI may suffer more from the common problem in semi-supervised learning, i.e. the performance is badly influenced due to the unlabeled examples may often be wrongly labeled. In this paper a semi-supervised k-nearest neighbor classifier named De-YATSI (YATSI with Data Editing) is proposed. A data editing based on estimating class conditional probability is used to identify and discard mislabeled examples of the pre-labeled data set. A k-nearest neighbor classifier with weights is trained by the labeled data set and the edited “pre-labeled” data set. Experiments on UCI datasets show that DE-YATSI could more effectively and stably utilize the unlabeled examples to improve classification accuracy than YATSI.
Keywords :
learning (artificial intelligence); text editing; De-YATSI; UCI datasets; class conditional probability estimation; data editing; pre-labeled data set; semisupervised k-nearest neighbor algorithm; semisupervised learning; Cybernetics; Information science; Iris; Iris recognition; Pattern recognition; Presses; Programmable logic arrays; data editing; k-nearest neighbor; semi-supervised learning; weka;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control and Decision Conference (CCDC), 2011 Chinese
Conference_Location :
Mianyang
Print_ISBN :
978-1-4244-8737-0
Type :
conf
DOI :
10.1109/CCDC.2011.5968142
Filename :
5968142
Link To Document :
بازگشت