Title :
Classification Model Learning for Bulletin Board Site Analysis Based on Unbalanced Textual Examples
Author :
Sakurai, Shigeki ; Orihara, Ryohei
Author_Institution :
Toshiba Corp., Kawasaki
Abstract :
This paper proposes a method that acquires a more appropriate classification model for label extraction. The model can extract specific labels from articles included in bulletin board sites. The labels represent the contents of the articles and are used to characterize the articles. The method selects two kinds of important examples not including a specific label by using expressions related to the label. The method inductively acquires the classification model from the selected examples and examples including the label. The paper applies the method to articles collected from three bulletin board sites and verifies its effect through comparative experiments.
Keywords :
feature extraction; image classification; image texture; bulletin board site analysis; classification model; label extraction; unbalanced textual examples; bulletin board site; imbalance problem; text classification;
Conference_Titel :
Advanced Information Networking and Applications, 2008. AINA 2008. 22nd International Conference on
Conference_Location :
Okinawa
Print_ISBN :
978-0-7695-3095-6
DOI :
10.1109/AINA.2008.57