• DocumentCode
    3215063
  • Title

    Classification Model Learning for Bulletin Board Site Analysis Based on Unbalanced Textual Examples

  • Author

    Sakurai, Shigeki ; Orihara, Ryohei

  • Author_Institution
    Toshiba Corp., Kawasaki
  • fYear
    2008
  • fDate
    25-28 March 2008
  • Firstpage
    494
  • Lastpage
    501
  • Abstract
    This paper proposes a method that acquires a more appropriate classification model for label extraction. The model can extract specific labels from articles included in bulletin board sites. The labels represent the contents of the articles and are used to characterize the articles. The method selects two kinds of important examples not including a specific label by using expressions related to the label. The method inductively acquires the classification model from the selected examples and examples including the label. The paper applies the method to articles collected from three bulletin board sites and verifies its effect through comparative experiments.
  • Keywords
    feature extraction; image classification; image texture; bulletin board site analysis; classification model; label extraction; unbalanced textual examples; bulletin board site; imbalance problem; text classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications, 2008. AINA 2008. 22nd International Conference on
  • Conference_Location
    Okinawa
  • ISSN
    1550-445X
  • Print_ISBN
    978-0-7695-3095-6
  • Type

    conf

  • DOI
    10.1109/AINA.2008.57
  • Filename
    4482747