• DocumentCode
    2157948
  • Title

    On Mining Micro-array data by Order-Preserving Submatrix

  • Author

    Lin Cheung ; Yip, Kevin Y. ; Cheung, David W. ; Kao, B. ; Ng, Michael K.

  • Author_Institution
    The University of Hong Kong, Hong Kong
  • fYear
    2005
  • fDate
    05-08 April 2005
  • Firstpage
    1153
  • Lastpage
    1153
  • Abstract
    We study the problem of pattern-based subspace clustering. Unlike traditional clustering methods that focus on grouping objects with similar values on a set of dimensions, clustering by pattern similarity finds objects that exhibit a coherent pattern of rises and falls in subspaces. Applications of pattern-based subspace clustering include DNA micro-array data analysis, automatic recommendation systems and target marketing systems. Our goal is to devise pattern-based clustering methods that are capable of (1) discovering useful patterns of various shapes, and (2) discovering all significant patterns. We argue that previous solutions in pattern-based subspace clustering do not satisfy both requirements. Our approach is to extend the idea of Order-Preserving Submatrix (or OPSM). We devise a novel algorithm for mining OPSM, show that OPSM can be generalized to cover most existing pattern-based clustering models, and propose a number of extension to the original OPSM model.
  • Keywords
    Data mining; Gene Expression; Patternbased clustering; Artificial intelligence; Bioinformatics; Clustering algorithms; Clustering methods; Computer science; DNA; Data analysis; Data mining; Gene expression; Shape; Data mining; Gene Expression; Patternbased clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering Workshops, 2005. 21st International Conference on
  • Print_ISBN
    0-7695-2657-8
  • Type

    conf

  • DOI
    10.1109/ICDE.2005.253
  • Filename
    1647756