• DocumentCode
    3568283
  • Title

    Dimension reduction with coevolutionary genetic algorithm for text classification

  • Author

    Gasanova, Tatiana ; Sergienko, Roman ; Semenkin, Eugene ; Minker, Wolfgang

  • Author_Institution
    Institute of Communications Engineering, Ulm University, Albert Einstein-Allee 43, 89081, Germany
  • Volume
    1
  • fYear
    2014
  • Firstpage
    215
  • Lastpage
    222
  • Abstract
    Text classification of large-size corpora is time-consuming for implementation of classification algorithms. For this reason, it is important to reduce dimension of text classification problems. We propose a method for dimension reduction based on hierarchical agglomerative clustering of terms and cluster weight optimization using cooperative coevolutionary genetic algorithm. The method was applied on 5 different corpora using several classification methods with different text preprocessing. The method reduces dimension of text classification problem significantly. Classification efficiency increases or decreases non-significantly after clustering with optimization of cluster weights.
  • Keywords
    Clustering algorithms; Dictionaries; Games; Genetic algorithms; Optimization; Support vector machines; Vocabulary; Clustering; Coevolutionary Algorithm; Dimension Reduction; Text Classification; Text Preprocessing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Informatics in Control, Automation and Robotics (ICINCO), 2014 11th International Conference on
  • Type

    conf

  • Filename
    7049774