DocumentCode
3215063
Title
Classification Model Learning for Bulletin Board Site Analysis Based on Unbalanced Textual Examples
Author
Sakurai, Shigeki ; Orihara, Ryohei
Author_Institution
Toshiba Corp., Kawasaki
fYear
2008
fDate
25-28 March 2008
Firstpage
494
Lastpage
501
Abstract
This paper proposes a method that acquires a more appropriate classification model for label extraction. The model can extract specific labels from articles included in bulletin board sites. The labels represent the contents of the articles and are used to characterize the articles. The method selects two kinds of important examples not including a specific label by using expressions related to the label. The method inductively acquires the classification model from the selected examples and examples including the label. The paper applies the method to articles collected from three bulletin board sites and verifies its effect through comparative experiments.
Keywords
feature extraction; image classification; image texture; bulletin board site analysis; classification model; label extraction; unbalanced textual examples; bulletin board site; imbalance problem; text classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Information Networking and Applications, 2008. AINA 2008. 22nd International Conference on
Conference_Location
Okinawa
ISSN
1550-445X
Print_ISBN
978-0-7695-3095-6
Type
conf
DOI
10.1109/AINA.2008.57
Filename
4482747
Link To Document