DocumentCode
2233865
Title
Online biomedical publication classification using Multi-Instance Multi-Label algorithms with feature reduction
Author
Ren, Dong ; Ma, Long ; Zhang, Yanqing ; Sunderraman, Raj ; Fox, Peter T. ; Laird, Angela R. ; Turner, Jessica A. ; Turner, Matthew D.
Author_Institution
Department of Computer Science, Georgia State University, Atlanta, USA
fYear
2015
fDate
6-8 July 2015
Firstpage
234
Lastpage
241
Abstract
Text annotation, the assignment of metadata to documents, requires significant time and effort when performed by humans. A variety of text mining methods have been used to automate this process, many of them based on either keyword extraction or word counts. However, when using keywords as text classification features, it is common to find that (1) the number of training instances is much less than the number of features extracted. This complexity affects text classification performance. Another challenge is (2) the assignment of multiple, non-exclusive labels to the documents (multi-label classification). This problem makes text classification more complicated when compared with single label classification. We use, as an example, a set of expertly labeled documents from the human functional neuroimaging literature, and we apply a Multi-instance Multi-label (MIML) classification algorithm to the problem. To address (1), we apply a feature reduction approach to reduce the feature dimension. For (2) we use an MIML algorithm called MIMLfast to implement the multi-label classification.
Keywords
Bagging; Classification algorithms; Gold; Metadata; Testing; Training; World Wide Web; feature reduction; multi-instance multi-label classification; multi-label classification; neuroinformatics; text annotation; text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Cognitive Informatics & Cognitive Computing (ICCI*CC), 2015 IEEE 14th International Conference on
Conference_Location
Beijing, China
Print_ISBN
978-1-4673-7289-3
Type
conf
DOI
10.1109/ICCI-CC.2015.7259391
Filename
7259391
Link To Document