DocumentCode :
2688689
Title :
Implication intensity: Randomized F-measure for cluster evaluation
Author :
Li, Limin ; Wu, Junjie ; Zhu, Shiwei
Author_Institution :
Sch. of Econ. & Manage., Beihang Univ., Beijing, China
fYear :
2009
fDate :
8-10 June 2009
Firstpage :
510
Lastpage :
515
Abstract :
The ever-growing resources of information and services on World Wide Web provide a welcome boost for the researches in the information retrieval space. Text clustering groups a set of documents into subsets or clusters so that the vast retrieved documents can be browsed selectively and efficiently. Many cluster validation measures, such as the F-measure, are then introduced to evaluate the clustering qualities. In this paper, however, we demonstrate that this widely adopted F-measure suffers from the so-call increment effect which may mislead the comparison of clustering results with different cluster numbers. To meet this challenge, we propose a novel ldquoimplication intensityrdquo (IMI) measure based on the F-measure and a random clustering perspective. Experimental results on real-world data sets demonstrate that IMI shows merits on alleviating the increment effect introduced by the F-measure.
Keywords :
Internet; information resources; information retrieval; pattern clustering; text analysis; F-measure; World Wide Web; cluster evaluation; implication intensity; increment effect; information resources; information retrieval; text clustering; Clustering algorithms; Information retrieval; Navigation; Organizing; Search engines; Web and internet services; Web sites; F-measure; cluster evaluation; implication intensity; increment effect;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Service Systems and Service Management, 2009. ICSSSM '09. 6th International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4244-3661-3
Electronic_ISBN :
978-1-4244-3662-0
Type :
conf
DOI :
10.1109/ICSSSM.2009.5174937
Filename :
5174937
Link To Document :
بازگشت