DocumentCode :
3215063
Title :
Classification Model Learning for Bulletin Board Site Analysis Based on Unbalanced Textual Examples
Author :
Sakurai, Shigeki ; Orihara, Ryohei
Author_Institution :
Toshiba Corp., Kawasaki
fYear :
2008
fDate :
25-28 March 2008
Firstpage :
494
Lastpage :
501
Abstract :
This paper proposes a method that acquires a more appropriate classification model for label extraction. The model can extract specific labels from articles included in bulletin board sites. The labels represent the contents of the articles and are used to characterize the articles. The method selects two kinds of important examples not including a specific label by using expressions related to the label. The method inductively acquires the classification model from the selected examples and examples including the label. The paper applies the method to articles collected from three bulletin board sites and verifies its effect through comparative experiments.
Keywords :
feature extraction; image classification; image texture; bulletin board site analysis; classification model; label extraction; unbalanced textual examples; bulletin board site; imbalance problem; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Information Networking and Applications, 2008. AINA 2008. 22nd International Conference on
Conference_Location :
Okinawa
ISSN :
1550-445X
Print_ISBN :
978-0-7695-3095-6
Type :
conf
DOI :
10.1109/AINA.2008.57
Filename :
4482747
Link To Document :
بازگشت