DocumentCode :
2704728
Title :
Principles for mining summaries using objective measures of interestingness
Author :
Hilderman, Robert J. ; Hamilton, Howard J.
Author_Institution :
Dept. of Comput. Sci., Regina Univ., Sask., Canada
fYear :
2000
fDate :
2000
Firstpage :
72
Lastpage :
81
Abstract :
An important problem in the area of data mining is the development of effective measures of interestingness for ranking discovered knowledge. The authors propose five principles that any measure must satisfy to be considered useful for ranking the interestingness of summaries generated from databases. We investigate the problem within the context of summarizing a single dataset which can be generalized in many different ways and to many levels of granularity. We perform a comparative sensitivity analysis of fifteen well-known diversity measures to identify those which satisfy the proposed principles. The fifteen diversity measures have previously been utilized in various disciplines, such as information theory, statistics, ecology, and economics. Their use as objective measures of interestingness for ranking summaries generated from databases is novel. The objective of this work is to gain some insight into the behaviour that can be expected from each of the diversity measures in practice, and to begin to develop a theory of interestingness against which the utility of new measures can be assessed
Keywords :
data mining; database theory; very large databases; comparative sensitivity analysis; data mining; database summaries; dataset; discovered knowledge ranking; diversity measures; ecology; economics; granularity levels; information theory; interestingness measures; objective measures of interestingness; summary mining; Area measurement; Computer science; Data mining; Databases; Environmental factors; Gain measurement; Information theory; Performance evaluation; Sensitivity analysis; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 2000. ICTAI 2000. Proceedings. 12th IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1082-3409
Print_ISBN :
0-7695-0909-6
Type :
conf
DOI :
10.1109/TAI.2000.889848
Filename :
889848
Link To Document :
بازگشت