DocumentCode :
2386618
Title :
Co_NBM: A Semi-Supervised Categorization Algorithm Based TEF_WA Technique
Author :
Tang Huanling ; Lu Mingyu ; Liu Na
Author_Institution :
Dalian Maritime Univ., Dalian
fYear :
2007
fDate :
2-4 Nov. 2007
Firstpage :
271
Lastpage :
271
Abstract :
We propose a semi-supervised categorization algorithm Co_NBM incorporating co-training and TEF_WA technique. General co-training algorithm relies on the assumption that the features set can be split into two compatible and independent views. However, the assumption is usually violated to some degree in practice and sometimes the natural feature split does not exist. TEF_WA technique utilizes term evaluation functions to reduce dimensionality and adjust terms weight. Now, it is used to construct multiple views. Our experimental results show that utilizing unlabeled data Co_NBM can significantly decrease classification error, especially when labeled training data are sparse.
Keywords :
Bayes methods; document handling; learning (artificial intelligence); pattern classification; Co_NBM co-training algorithm; TEF_WA technique; document classification; naive Bayes classifier; semi-supervised document categorization algorithm; semi-supervised learning; term evaluation function-weight adjustment; Assembly; Constraint theory; Semisupervised learning; Smoothing methods; Text categorization; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing, 2007. GRC 2007. IEEE International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3032-1
Type :
conf
DOI :
10.1109/GrC.2007.14
Filename :
4403108
Link To Document :
بازگشت