Title :
An enhanced EM method of semi-supervised classification based on Naive Bayesian
Author :
Wen Han ; Xiao Nan-Feng ; Li Zhao
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of Technol., Guangzhou, China
Abstract :
Semi-supervised learning (SSL) based on Naïve Bayesian and Expectation Maximization (EM) combines small limited numbers of labeled data with a large amount of unlabeled data to help train classifier and increase classification accuracy. With the aim of improving the efficiency problem of the basic EM algorithm, an enhanced EM method is proposed. Firstly, a feature selection function of strong category information is constructed to control the dimension of feature vector and preserve useful feature terms. Secondly, an intermediate classifier gradually transfers unlabeled documents of maximum posterior category probability to labeled collection during each iteration process of the EM algorithm. The iteration number of the enhanced EM is obviously less than the basic EM. Finally, experiments shows that the improved method obtains very effective performance in terms of macro average accuracy and algorithm efficiency.
Keywords :
Bayes methods; expectation-maximisation algorithm; learning (artificial intelligence); text analysis; algorithm efficiency; automatic text classification; category information; expectation maximization method enhancement; feature selection function; feature terms; feature vector; intermediate classifier; iteration process; macro average accuracy; maximum posterior category probability; naive Bayesian; semisupervised learning; Accuracy; Bayesian methods; Classification algorithms; Educational institutions; Machine learning; Mathematical model; Text categorization; Naïve Bayesian; Semi-supervised classification; enhanced EM; feature selection;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-180-9
DOI :
10.1109/FSKD.2011.6019690