• DocumentCode
    2723246
  • Title

    Discovering Connected Patterns in Gene Expression Arrays

  • Author

    Yousri, Noha A. ; Ismail, Mohamed A. ; Kamel, Mohamed S.

  • Author_Institution
    IEEE Comput. & Syst. Eng., Alexandria Univ. of Alexandria
  • fYear
    2007
  • fDate
    1-5 April 2007
  • Firstpage
    113
  • Lastpage
    120
  • Abstract
    Clustering methods have been extensively used for gene expression data analysis to detect groups of related genes. The clusters provide useful information to analyze gene function, gene regulation and cellular patterns. Most existing clustering algorithms, though, discover only coherent gene expression patterns, and do not handle connected patterns. Coherent and connected patterns correspond to globular and arbitrary shaped clusters, respectively, in low dimensional spaces. For high dimensional gene expression data, two connected patterns can be two similar patterns with time lags in a time series data, or in general, two different patterns that are connected by an intermediate pattern that is related to both of them. Discovering such connected patterns has important biological implications not revealed by groups of coherent patterns. In this paper, a novel algorithm that finds connected patterns, in gene expression data, is proposed. Using a novel merge criterion, it can distinguish clusters based on distances between patterns, thus avoiding the effect of noise and outliers. Moreover, the algorithm uses a metric based on Pearson correlation to find neighbours, which renders it a lower complexity than related algorithms. Both time series and non temporal gene expression data sets are used to illustrate the efficiency of the proposed algorithm. Results on the serum and the leukaemia data sets reveal interesting biologically significant information
  • Keywords
    biology computing; data analysis; data mining; genetics; pattern clustering; cellular patterns; clustering method; connected pattern discovery; gene expression arrays; gene expression data analysis; gene function; gene regulation; Algorithm design and analysis; Bioinformatics; Biology computing; Clustering algorithms; Clustering methods; Computational biology; Computational intelligence; Couplings; Gene expression; Pattern analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Bioinformatics and Computational Biology, 2007. CIBCB '07. IEEE Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0710-9
  • Type

    conf

  • DOI
    10.1109/CIBCB.2007.4221212
  • Filename
    4221212