Title :
Semi-supervised biomedical relation classification using generalized expectation criteria
Author :
Sun, Cheng-jie ; Lin Yao ; Lin, Lei ; Sha, Xue-Jun ; Wang, Xiao-long
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Abstract :
Semi-supervised learning is necessary for the extensive application of machine learning in practice. It can use unlabeled data to improve the performance of existing supervised machine learning method. In this work, we addressed the biomedical relation classification problem by utilizing a semi-supervised method which can train Maximum Entropy models according to Generalized Expectation criteria. In the proposed method, instead of "instance labeling" used in previous works, the "feature labeling" was applied to get the training data which can save lots of labeling time. A topic model was involved to choose the features for labeling. Experiment results show that the proposed method can dramatically improve the performance of biomedical relation classification through incorporating unlabeled data by feature labeling.
Keywords :
data mining; entropy; learning (artificial intelligence); medical computing; pattern classification; text analysis; biomedical text mining; generalized expectation criteria; instance labeling; machine learning; maximum entropy models; semi-supervised biomedical relation classification; semi-supervised learning; supervised machine learning method; Biological system modeling; Data mining; Feature extraction; Foot; Hafnium compounds; Labeling; Proteins; biomedical relation classification; generalized expectation criteria; maximum entropy; semi-supervised learning;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
Conference_Location :
Guilin
Print_ISBN :
978-1-4577-0305-8
DOI :
10.1109/ICMLC.2011.6016953