DocumentCode :
2057332
Title :
A Semi-supervised Text Classification Method Based on Incremental EM Algorithm
Author :
Fan, Xinghua ; Guo, Zhiyi
Author_Institution :
Coll. of Comput. Sci. & Technol., Chongqing Univ. of Posts & Telecommun., Chongqing, China
Volume :
2
fYear :
2010
fDate :
14-15 Aug. 2010
Firstpage :
211
Lastpage :
214
Abstract :
In the standard EM-based semi-supervised text classification, the classification performance is not well when the initial labeled samples are a few. How to improve the performance is an important issue. In view of this, a semi-supervised method based on incremental EM algorithm is proposed. This method makes full use of the useful information of intermediate classifier. On the one hand, this method verifies the feasibility of division existed in unlabeled samples, and uses the division mechanism to enhance the reliability of new incremental samples by dividing the unlabeled samples scientifically; on the other hand, a feedback learning mechanism is proposed, and it is used to decrease the probability of adding misclassified samples. Experimental results show that the classification performance is improved in our method.
Keywords :
iterative methods; learning (artificial intelligence); pattern classification; text analysis; feedback learning mechanism; incremental EM algorithm; intermediate classifier; iterative algorithms; semisupervised text classification method; Classification algorithms; Learning systems; Medical services; Niobium; Reliability; Text categorization; Training; EM algorithm; division mechanism; feedback learning; semi-supervised learning; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Engineering (ICIE), 2010 WASE International Conference on
Conference_Location :
Beidaihe, Hebei
Print_ISBN :
978-1-4244-7506-3
Electronic_ISBN :
978-1-4244-7507-0
Type :
conf
DOI :
10.1109/ICIE.2010.146
Filename :
5571349
Link To Document :
بازگشت