DocumentCode
3105629
Title
The PDD Framework for Detecting Categories of Peculiar Data
Author
Shrestha, Mahesh ; Hamilton, Howard J. ; Yao, Yiyu ; Konkel, Ken ; Geng, Liqiang
Author_Institution
Dept. of Comput. Sci., Univ. of Regina, Regina, SK
fYear
2006
fDate
18-22 Dec. 2006
Firstpage
562
Lastpage
571
Abstract
Peculiar data are objects that are relatively few in number and significantly different from the other objects in a data set. In this paper, we propose the PDD framework for detecting multiple categories of peculiar data. This framework provides an extensible set of perspectives for viewing data, currently including viewing data as a set of records, attributes, frequencies, intervals, sequences, or sequences of changes. By using these six views of the data, multiple categories of peculiar data can be detected to reveal different aspects of the data. For each view, the framework provides an extensible set of peculiarity measures to detect outliers and other kinds of peculiar data. The PDD framework has been implemented for Oracle and Access. Experiments are reported for data sets concerning Regina weather and NHL hockey.
Keywords
data analysis; data mining; database management systems; Access; NHL hockey; Oracle; PDD framework; Regina weather; data aspects; outlier detection; peculiar data category detection; peculiarity measure; Cancer; Cleaning; Computer science; Density measurement; Event detection; Frequency; Intrusion detection; Medical diagnosis; Object detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location
Hong Kong
ISSN
1550-4786
Print_ISBN
0-7695-2701-7
Type
conf
DOI
10.1109/ICDM.2006.159
Filename
4053082
Link To Document