Title :
Evaluation of Text Clustering Based on Iterative Classification
Author :
Wang Xiaohua ; Lou Jia
Author_Institution :
Inst. of Comput. Applic. Technol., Hang Zhou Dian Zi Univ., Hangzhou, China
Abstract :
Text clustering is a useful and inexpensive way to organize vast text repositories into meaningful topics categories. Although text clustering can be seen as an alternative to supervised text categorization, the question remains of how to determine if the resulting clusters are of sufficient quality in a real-life application. However, it is difficult to evaluate a given clustering of documents. Furthermore, the existing quality measures rely on the labor standard, which is difficult and time-consuming. The need for fair methods that can assess the validation of clustering results is becoming more and more critical. In this paper, we propose and experiment an innovative evaluation measure that allows one to effectively and correctly assess the clustering results.
Keywords :
iterative methods; pattern clustering; text analysis; document clustering; iterative classification; supervised text categorization; text clustering; text repository; topics category; Clustering algorithms; Computer applications; Internet; Iterative methods; Management information systems; Measurement standards; Navigation; Software libraries; Testing; Text categorization;
Conference_Titel :
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4507-3
Electronic_ISBN :
978-1-4244-4507-3
DOI :
10.1109/CISE.2009.5364099