DocumentCode
2157948
Title
On Mining Micro-array data by Order-Preserving Submatrix
Author
Lin Cheung ; Yip, Kevin Y. ; Cheung, David W. ; Kao, B. ; Ng, Michael K.
Author_Institution
The University of Hong Kong, Hong Kong
fYear
2005
fDate
05-08 April 2005
Firstpage
1153
Lastpage
1153
Abstract
We study the problem of pattern-based subspace clustering. Unlike traditional clustering methods that focus on grouping objects with similar values on a set of dimensions, clustering by pattern similarity finds objects that exhibit a coherent pattern of rises and falls in subspaces. Applications of pattern-based subspace clustering include DNA micro-array data analysis, automatic recommendation systems and target marketing systems. Our goal is to devise pattern-based clustering methods that are capable of (1) discovering useful patterns of various shapes, and (2) discovering all significant patterns. We argue that previous solutions in pattern-based subspace clustering do not satisfy both requirements. Our approach is to extend the idea of Order-Preserving Submatrix (or OPSM). We devise a novel algorithm for mining OPSM, show that OPSM can be generalized to cover most existing pattern-based clustering models, and propose a number of extension to the original OPSM model.
Keywords
Data mining; Gene Expression; Patternbased clustering; Artificial intelligence; Bioinformatics; Clustering algorithms; Clustering methods; Computer science; DNA; Data analysis; Data mining; Gene expression; Shape; Data mining; Gene Expression; Patternbased clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering Workshops, 2005. 21st International Conference on
Print_ISBN
0-7695-2657-8
Type
conf
DOI
10.1109/ICDE.2005.253
Filename
1647756
Link To Document