• DocumentCode
    569307
  • Title

    Exploring Neighborhood Influence in Text Classification

  • Author

    Le, Nam Do-Hoang ; Tran, Thai-Son ; Tran, Minh-Triet

  • Author_Institution
    Fac. of Inf. Technol., Univ. of Sci., Ho Chi Minh City, Vietnam
  • fYear
    2012
  • fDate
    17-19 Aug. 2012
  • Firstpage
    79
  • Lastpage
    85
  • Abstract
    Standard supervised learning approaches have been widely applied on the text classification problem. These standard approaches exploit only the local content of the document. However, the additional information in the relationship between the items can be used to improve the overall accuracy of the classification process. To make use of this information, the authors propose a statistical model to capture both the contents and labels from each link the neighborhood. This link model is then incorporated with the Markov Random Field model to form the soft labeling model for text classification. This new approach has combined both the local content and the influence from the neighborhood. The results of soft labeling model on standard data sets are also promising. Moreover, the new model can be applied on not only the text classification problem but also many kinds of richly structured data sets.
  • Keywords
    Markov processes; learning (artificial intelligence); pattern classification; random processes; set theory; statistical analysis; text analysis; Markov random field model; accuracy improvement; document local contents; link model; soft-labeling model; standard data sets; standard supervised learning approaches; statistical model; text classification problem; Accuracy; Computational modeling; Correlation; Labeling; Logistics; Support vector machines; Text categorization; bibliological networks; document categorization; graphical model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Knowledge and Systems Engineering (KSE), 2012 Fourth International Conference on
  • Conference_Location
    Danang
  • Print_ISBN
    978-1-4673-2171-6
  • Type

    conf

  • DOI
    10.1109/KSE.2012.35
  • Filename
    6299402