• DocumentCode
    2334192
  • Title

    Significance tests for patterns in continuous data

  • Author

    Bolton, Richard J. ; Hand, David J.

  • Author_Institution
    Dept. of Math., Imperial Coll., London, UK
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    67
  • Lastpage
    74
  • Abstract
    The authors consider the question of uncertainty of detected patterns in data mining. In particular, we develop statistical tests for patterns found in continuous data, indicating the significance of these patterns in terms of the probability that they have occurred by chance. We examine the performance of these tests on patterns detected in several large data sets, including a data set describing the locations of earthquakes in California and another describing flow cytometry measurements on phytoplankton
  • Keywords
    data mining; statistical analysis; uncertainty handling; very large databases; California; continuous data patterns; data mining; detected pattern uncertainty; earthquake locations; flow cytometry measurements; large data sets; phytoplankton; probability; significance tests; statistical tests; Buildings; Data mining; Earthquakes; Educational institutions; Mathematics; Pattern analysis; Probability; Sequences; Testing; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    0-7695-1119-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2001.989502
  • Filename
    989502