• DocumentCode
    2396982
  • Title

    Text clustering ensemble based on genetic algorithms

  • Author

    Mao-ting Gao ; Bing-jing Wang

  • Author_Institution
    Coll. of Inf. Eng., Shanghai Maritime Univ., Shanghai, China
  • fYear
    2012
  • fDate
    19-20 May 2012
  • Firstpage
    2329
  • Lastpage
    2332
  • Abstract
    Text feature is usually expressed as a matrix of huge dimensionality in text mining, and common clustering algorithm are not stable and cannot obtain clustering solution efficiently. Latent Semantic Analysis can reduce dimensionality effectively, and emerges the semantic relations between texts and terms. Clustering ensemble can get better clustering solution than single clustering method. A text clustering ensemble based on genetic algorithms is presented, which combines Latent Semantic Analysis and Clustering ensemble based on genetic algorithms. Experiments have demonstrated that text clustering ensemble based on genetic algorithms can effectively improve the clustering performance.
  • Keywords
    data mining; genetic algorithms; natural language processing; pattern clustering; text analysis; dimensionality reduction; genetic algorithms; huge dimensionality; latent semantic analysis; text clustering ensemble; text feature; text mining; Accuracy; Algorithm design and analysis; Biological cells; Clustering algorithms; Genetic algorithms; Matrix decomposition; Semantics; clustering ensemble; genetic algorithm; latent semantic analysis; text clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems and Informatics (ICSAI), 2012 International Conference on
  • Conference_Location
    Yantai
  • Print_ISBN
    978-1-4673-0198-5
  • Type

    conf

  • DOI
    10.1109/ICSAI.2012.6223521
  • Filename
    6223521