Title :
A new evaluation measure for information retrieval systems
Author :
Martin Mehlitz;Christian Bauckhage;Jerome Kunegis;Sahin Albayrak
Author_Institution :
Technical University Berlin, DAI-Labor, 10587, Germany
Abstract :
Some of the established approaches to evaluating text clustering algorithms for information retrieval show theoretical flaws. In this paper, we analyze these flaws and introduce a new evaluation measure to overcome them. Based on a simple yet rigorous mathematical analysis of the effect of certain parameters in cluster based retrieval, we show that certain conclusions drawn in the recent literature must be taken with a grain of salt. Our new measure, in contrast, accounts for statistical biases that have to be expected according to our analysis. A series of experiments and a comparison with results reported recently underlines that this measure is a more suitable performance indicator that allows for more meaningful interpretations.
Keywords :
"Information retrieval","Clustering algorithms","Mathematical analysis","Laboratories","Computer science","Storage automation","Mathematical model","Information analysis","Search engines"
Conference_Titel :
Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on
Print_ISBN :
978-1-4244-0990-7
DOI :
10.1109/ICSMC.2007.4413662