• DocumentCode
    3105629
  • Title

    The PDD Framework for Detecting Categories of Peculiar Data

  • Author

    Shrestha, Mahesh ; Hamilton, Howard J. ; Yao, Yiyu ; Konkel, Ken ; Geng, Liqiang

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Regina, Regina, SK
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    562
  • Lastpage
    571
  • Abstract
    Peculiar data are objects that are relatively few in number and significantly different from the other objects in a data set. In this paper, we propose the PDD framework for detecting multiple categories of peculiar data. This framework provides an extensible set of perspectives for viewing data, currently including viewing data as a set of records, attributes, frequencies, intervals, sequences, or sequences of changes. By using these six views of the data, multiple categories of peculiar data can be detected to reveal different aspects of the data. For each view, the framework provides an extensible set of peculiarity measures to detect outliers and other kinds of peculiar data. The PDD framework has been implemented for Oracle and Access. Experiments are reported for data sets concerning Regina weather and NHL hockey.
  • Keywords
    data analysis; data mining; database management systems; Access; NHL hockey; Oracle; PDD framework; Regina weather; data aspects; outlier detection; peculiar data category detection; peculiarity measure; Cancer; Cleaning; Computer science; Density measurement; Event detection; Frequency; Intrusion detection; Medical diagnosis; Object detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2006. ICDM '06. Sixth International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2701-7
  • Type

    conf

  • DOI
    10.1109/ICDM.2006.159
  • Filename
    4053082