Title :
Word Categorization Using Clustering Ensemble
Author :
Abdoos, Monireh ; Naeini, Seyed Gholamreza Jalali
Author_Institution :
Iran Univ. of Sci. & Technol., Tehran, Iran
Abstract :
Entropy model is the base structure of automated word categorizing. In this model, words appear consecutive frequently will place in different groups. Although this method is not correct always, in the most cases, obtained results simulate real situation. Because of NP-complete structure of clustering problems, the entropy model cannot be solved by an optimal algorithm, so a number of heuristic algorithms were developed to solve this problem. Some examples of these heuristics are artificial neural networks, genetic algorithms, greedy algorithms and so on. In this paper, we used a clustering ensemble method for word categorization. The method uses a new feature space generated by k-means for clustering. The results show that an improvement in categorization is obtained.
Keywords :
classification; entropy; pattern clustering; text analysis; clustering ensemble; entropy model; k-means clustering; word categorization; Artificial neural networks; Clustering algorithms; Computer networks; Entropy; Frequency; Genetic algorithms; Heuristic algorithms; Natural languages; Neural networks; Statistics; Clustering Ensemble; Entropy; Mutual Information; Word Categorization;
Conference_Titel :
Advanced Computer Theory and Engineering, 2008. ICACTE '08. International Conference on
Conference_Location :
Phuket
Print_ISBN :
978-0-7695-3489-3
DOI :
10.1109/ICACTE.2008.166