DocumentCode
2334192
Title
Significance tests for patterns in continuous data
Author
Bolton, Richard J. ; Hand, David J.
Author_Institution
Dept. of Math., Imperial Coll., London, UK
fYear
2001
fDate
2001
Firstpage
67
Lastpage
74
Abstract
The authors consider the question of uncertainty of detected patterns in data mining. In particular, we develop statistical tests for patterns found in continuous data, indicating the significance of these patterns in terms of the probability that they have occurred by chance. We examine the performance of these tests on patterns detected in several large data sets, including a data set describing the locations of earthquakes in California and another describing flow cytometry measurements on phytoplankton
Keywords
data mining; statistical analysis; uncertainty handling; very large databases; California; continuous data patterns; data mining; detected pattern uncertainty; earthquake locations; flow cytometry measurements; large data sets; phytoplankton; probability; significance tests; statistical tests; Buildings; Data mining; Earthquakes; Educational institutions; Mathematics; Pattern analysis; Probability; Sequences; Testing; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location
San Jose, CA
Print_ISBN
0-7695-1119-8
Type
conf
DOI
10.1109/ICDM.2001.989502
Filename
989502
Link To Document