DocumentCode :
185999
Title :
On semantic evaluation of text clustering algorithms
Author :
Sinh Hoa Nguyen ; Swieboda, Wojciech ; Hung Son Nguyen
Author_Institution :
Inst. of Math., Univ. of Warsaw, Warsaw, Poland
fYear :
2014
fDate :
22-24 Oct. 2014
Firstpage :
224
Lastpage :
229
Abstract :
In this paper, we investigate the problem of quality analysis of clustering results using semantic annotations given by experts. In previous work we proposed a novel approach to construction of evaluation measure, called SEE (Semantic Evaluation by Exploration), which is an extension of the existing methods such as Rand Index or Normalized Mutual Information. In this paper we present some further extensions as well as some theoretical properties of the of the proposed measure. We illustrate the proposed evaluation method on documents in INFONA document retrieval system. We compare different search result clustering algorithms using the proposed measure.
Keywords :
information retrieval; pattern clustering; text analysis; INFONA document retrieval system; SEE; quality analysis; semantic annotations; semantic evaluation by exploration; text clustering algorithms; Approximation algorithms; Clustering algorithms; Decision trees; Indexes; Mutual information; Semantics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing (GrC), 2014 IEEE International Conference on
Conference_Location :
Noboribetsu
Type :
conf
DOI :
10.1109/GRC.2014.6982839
Filename :
6982839
Link To Document :
بازگشت