• DocumentCode
    182989
  • Title

    A common-subsequence-based approach for mining deep order preserving submatrix

  • Author

    Yun Xue ; Tiechen Li ; Zhiwen Liu ; Zhengling Liao ; Hua Xiao ; Hongya Zhao ; Xiaohui Hu

  • Author_Institution
    Sch. of Phys. & Telecommun. Eng., South China Normal Univ., Guangzhou, China
  • fYear
    2014
  • fDate
    19-21 Aug. 2014
  • Firstpage
    334
  • Lastpage
    340
  • Abstract
    As an effective biclustering model, order-preserving submatrix (OPSM) has been widely applied to biological gene expression data mining. Recently, biologists hope to find deep OPSMs with long patterns and comparatively few support rows, which are not only useful on the interpretation of gene regulatory networks but also have essential biological significance. Unfortunately, the traditional exact mining algorithms based on Apriori principle could not deal with the deep OPSM problem, since they often take a large minimum support threshold for pattern pruning, and inevitably miss some significant deep OPSMs. Therefore, this paper proposes a new exact algorithm for mining deep OPSMs, which obtain all the deep OPSMs by finding the common subsequences shared by every two rows. Experiments have been done in both real and synthetic data sets, and the results show that our algorithm is suitable for the full mining of deep OPSMs with a small support, which could even find all the deep OPSMs with a minimum support threshold of 2. Compared with the traditional sequential pattern mining algorithms which depend on relatively large support threshold, our algorithm is an effective one to solve the deep OPSM problem.
  • Keywords
    biology computing; data mining; genetics; pattern clustering; OPSM; biclustering model; biological gene expression data mining; biological significance; common-subsequence-based approach; deep order preserving submatrix mining; gene regulatory networks; Algorithm design and analysis; Biological system modeling; Data mining; Gene expression; Indexes; biclustering; common subsequence; deep order preserving submatrix;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-1-4799-5147-5
  • Type

    conf

  • DOI
    10.1109/FSKD.2014.6980856
  • Filename
    6980856