• Title of article

    Automatic text categorization based on content analysis with cognitive situation models

  • Author/Authors

    Yi Guo، نويسنده , , Zhiqing Shao، نويسنده , , Nan Hua، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2010
  • Pages
    18
  • From page
    613
  • To page
    630
  • Abstract
    Text categorization is an important research area of text mining. The original purpose of text categorization is to recognize, understand and organize different types of texts or documents. The general categorization approaches are treated as supervised learning, which infers similarity among a collection of categorized texts for training purposes. The existing categorization approaches are obviously not content-oriented and constrained at single word level. This paper introduces an innovative content-oriented text categorization approach named as CogCate. Inspired by cognitive situation models, CogCate exploits a human cognitive procedure in categorizing texts. In addition to traditional statistical analysis at word level, CogCate also applies lexical/semantical analysis, which ensures the accuracy of categorization. The evaluation experiments have testified the performance of CogCate. Meanwhile, CogCate remarkably reduces the time and effort spent on software training and maintenance of text collections. Our research work attests that interdisciplinary research efforts benefit text categorization.
  • Keywords
    Lexical/semantical analysis , Text Categorization , Cognitive situation models , Content analysis
  • Journal title
    Information Sciences
  • Serial Year
    2010
  • Journal title
    Information Sciences
  • Record number

    1213857