DocumentCode :
3261353
Title :
NHOP: Detecting descriptive patterns using association pattern mining
Author :
Lui, Thomas W H ; Chiu, David K Y
Author_Institution :
Dept. of Comput. & Inf. Sci., Univ. of Guelph, Guelph, ON
fYear :
2008
fDate :
26-28 Aug. 2008
Firstpage :
491
Lastpage :
496
Abstract :
To facilitate interpretation and consider the relationships of internal interdependency between data values, a new form of high-order (multiple-valued) pattern known as Nested High-Order Pattern (NHOP) is presented. This pattern satisfies a consistent statistical criterion when the pattern is iteratively extracted. The general form of High-Order Pattern (HOP), that NHOP is a subtype, is a set of multiple associated values (identified as variable outcomes) extracted from a random N-tuple. The pattern is detected by statistical testing if the occurrence is significantly deviated from the expected according to a prior model or null hypothesis. Here we extend our work of NHOP to classification and clustering tasks to identify a clearer description of the patterns. The rationale is that, complex association patterns reinforce the underlying meaningfulness in interpreting the regularity, thus can provide a better understanding of the classification and data domain. We develop a search strategy, called NHOP-Covering algorithm, to detect a set of NHOP. Experiments on benchmark machine learning data and real-world biomolecular data are evaluated with promising results.
Keywords :
data mining; pattern classification; pattern clustering; association pattern mining; clustering tasks; complex association patterns; data classification; machine learning data; multiple associated values; nested high-order pattern; statistical testing; Bioinformatics; Classification tree analysis; Clustering algorithms; Data mining; Frequency estimation; Information science; Machine learning; Machine learning algorithms; Statistical analysis; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing, 2008. GrC 2008. IEEE International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-2512-9
Electronic_ISBN :
978-1-4244-2513-6
Type :
conf
DOI :
10.1109/GRC.2008.4664672
Filename :
4664672
Link To Document :
بازگشت