DocumentCode
2396982
Title
Text clustering ensemble based on genetic algorithms
Author
Mao-ting Gao ; Bing-jing Wang
Author_Institution
Coll. of Inf. Eng., Shanghai Maritime Univ., Shanghai, China
fYear
2012
fDate
19-20 May 2012
Firstpage
2329
Lastpage
2332
Abstract
Text feature is usually expressed as a matrix of huge dimensionality in text mining, and common clustering algorithm are not stable and cannot obtain clustering solution efficiently. Latent Semantic Analysis can reduce dimensionality effectively, and emerges the semantic relations between texts and terms. Clustering ensemble can get better clustering solution than single clustering method. A text clustering ensemble based on genetic algorithms is presented, which combines Latent Semantic Analysis and Clustering ensemble based on genetic algorithms. Experiments have demonstrated that text clustering ensemble based on genetic algorithms can effectively improve the clustering performance.
Keywords
data mining; genetic algorithms; natural language processing; pattern clustering; text analysis; dimensionality reduction; genetic algorithms; huge dimensionality; latent semantic analysis; text clustering ensemble; text feature; text mining; Accuracy; Algorithm design and analysis; Biological cells; Clustering algorithms; Genetic algorithms; Matrix decomposition; Semantics; clustering ensemble; genetic algorithm; latent semantic analysis; text clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems and Informatics (ICSAI), 2012 International Conference on
Conference_Location
Yantai
Print_ISBN
978-1-4673-0198-5
Type
conf
DOI
10.1109/ICSAI.2012.6223521
Filename
6223521
Link To Document