DocumentCode
2057332
Title
A Semi-supervised Text Classification Method Based on Incremental EM Algorithm
Author
Fan, Xinghua ; Guo, Zhiyi
Author_Institution
Coll. of Comput. Sci. & Technol., Chongqing Univ. of Posts & Telecommun., Chongqing, China
Volume
2
fYear
2010
fDate
14-15 Aug. 2010
Firstpage
211
Lastpage
214
Abstract
In the standard EM-based semi-supervised text classification, the classification performance is not well when the initial labeled samples are a few. How to improve the performance is an important issue. In view of this, a semi-supervised method based on incremental EM algorithm is proposed. This method makes full use of the useful information of intermediate classifier. On the one hand, this method verifies the feasibility of division existed in unlabeled samples, and uses the division mechanism to enhance the reliability of new incremental samples by dividing the unlabeled samples scientifically; on the other hand, a feedback learning mechanism is proposed, and it is used to decrease the probability of adding misclassified samples. Experimental results show that the classification performance is improved in our method.
Keywords
iterative methods; learning (artificial intelligence); pattern classification; text analysis; feedback learning mechanism; incremental EM algorithm; intermediate classifier; iterative algorithms; semisupervised text classification method; Classification algorithms; Learning systems; Medical services; Niobium; Reliability; Text categorization; Training; EM algorithm; division mechanism; feedback learning; semi-supervised learning; text classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Engineering (ICIE), 2010 WASE International Conference on
Conference_Location
Beidaihe, Hebei
Print_ISBN
978-1-4244-7506-3
Electronic_ISBN
978-1-4244-7507-0
Type
conf
DOI
10.1109/ICIE.2010.146
Filename
5571349
Link To Document