• DocumentCode
    2942505
  • Title

    Improving the Performance of Text Categorization Using Automatic Summarization

  • Author

    Jiang Xiao-Yu ; Fan Xiao-Zhong ; Wang Zhi-Fei ; Jia Ke-Liang

  • Author_Institution
    Bus. Sch., Beijing Inst. of Fashion Technol., Beijing
  • fYear
    2009
  • fDate
    20-22 Feb. 2009
  • Firstpage
    347
  • Lastpage
    351
  • Abstract
    In order to reduce the dimensionality of feature vector space and reduce the computing complexity of categorization, each document of the train set is summarized automatically and two approaches to text categorization based on these summaries are proposed: in the first approach, the text summarization is directly used for feature selection and categorization instead of the original text; in the second approach, each summary is used to select and weight features for each document, and free texts are classified using KNN algorithm. Experimental results show that the two proposed methods using automatic summarization can not only reduce the time of classifier training, but also improve the performance of text categorization.
  • Keywords
    computational complexity; text analysis; KNN algorithm; automatic summarization; computing complexity; feature selection; feature vector space; text categorization; text summarization; Computational modeling; Computer science; Computer simulation; Equations; Noise reduction; Space technology; Testing; Text categorization; automatic summarization; feature selection; text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Modeling and Simulation, 2009. ICCMS '09. International Conference on
  • Conference_Location
    Macau
  • Print_ISBN
    978-0-7695-3562-3
  • Electronic_ISBN
    978-1-4244-3561-6
  • Type

    conf

  • DOI
    10.1109/ICCMS.2009.29
  • Filename
    4797414