Title :
Local Feature Selection for Generation of Ensembles in Text Clustering
Author :
Ribeiro, Marcelo Nunes ; Encio, Ricardo Bastos Cavalcante Prud
Author_Institution :
Centro de Inf., Univ. Fed. de Pernambuco, Recife, Brazil
Abstract :
In the context of text clustering, global feature selection tries to identify a single subset of features which are relevant to all clusters. However, the clustering process might be improved by considering different subsets of features for locally describing each cluster. In experiments with local feature selection, it was observed that the resulting partitions were unstable but there were cohesive groups that did not occur in all executions. Based on this result, local feature selection was proposed to generate partitions to be used in ensemble clustering. New experiments were performed to evaluate the generated ensembles and a gain in precision was observed.
Keywords :
pattern clustering; text analysis; ensembles; local feature selection; text clustering; Clustering algorithms; Context; Diversity reception; Feature extraction; Indexing; Partitioning algorithms; Text categorization; cluster ensembles; feature selection; text clustering;
Conference_Titel :
Neural Networks (SBRN), 2010 Eleventh Brazilian Symposium on
Conference_Location :
Sao Paulo
Print_ISBN :
978-1-4244-8391-4
Electronic_ISBN :
1522-4899
DOI :
10.1109/SBRN.2010.20