DocumentCode
2386618
Title
Co_NBM: A Semi-Supervised Categorization Algorithm Based TEF_WA Technique
Author
Tang Huanling ; Lu Mingyu ; Liu Na
Author_Institution
Dalian Maritime Univ., Dalian
fYear
2007
fDate
2-4 Nov. 2007
Firstpage
271
Lastpage
271
Abstract
We propose a semi-supervised categorization algorithm Co_NBM incorporating co-training and TEF_WA technique. General co-training algorithm relies on the assumption that the features set can be split into two compatible and independent views. However, the assumption is usually violated to some degree in practice and sometimes the natural feature split does not exist. TEF_WA technique utilizes term evaluation functions to reduce dimensionality and adjust terms weight. Now, it is used to construct multiple views. Our experimental results show that utilizing unlabeled data Co_NBM can significantly decrease classification error, especially when labeled training data are sparse.
Keywords
Bayes methods; document handling; learning (artificial intelligence); pattern classification; Co_NBM co-training algorithm; TEF_WA technique; document classification; naive Bayes classifier; semi-supervised document categorization algorithm; semi-supervised learning; term evaluation function-weight adjustment; Assembly; Constraint theory; Semisupervised learning; Smoothing methods; Text categorization; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Granular Computing, 2007. GRC 2007. IEEE International Conference on
Conference_Location
Fremont, CA
Print_ISBN
978-0-7695-3032-1
Type
conf
DOI
10.1109/GrC.2007.14
Filename
4403108
Link To Document