• DocumentCode
    2358595
  • Title

    Discovering clusters in gene expression data using evolutionary approach

  • Author

    Ma, Patrick C H ; Chan, Keith C C

  • Author_Institution
    Dept. of Comput., Hong Kong Polytech. Univ., China
  • fYear
    2003
  • fDate
    3-5 Nov. 2003
  • Firstpage
    459
  • Lastpage
    466
  • Abstract
    The combined interpretation of gene expression data and gene sequences offers a valuable approach to investigate the intricate relationships involving gene transcriptional regulation. The highly interactive expression data produced by microarray hybridization experiments allow us to find the clusters of coexpressed genes. By analyzing the upstream regions of the identified coexpressed genes, we can discover the regulatory patterns characterized by transcription factor binding sites, which govern the process of transcriptional regulation. This paper presents a generic clustering algorithm that uses a Hybrid GA approach to discover clusters in gene expression data. The advantage of this method is that large search space can be effectively explored by utilizing the evolutionary algorithm techniques. Moreover, it is able to discover underlying patterns in noisy gene expression data for meaningful data groupings, and also statistically significant patterns hidden in each cluster can be extracted at the same time. Since the proposed method can handle both continuous-and discrete-valued data, it can be used with different microarray and biomedical data. To test its effectiveness, we have used it on real expression data. The experimental results reveal meaningful groupings and uncover many known transcription factor binding sites.
  • Keywords
    DNA; algorithm theory; biology computing; genetic algorithms; molecular biophysics; DNA; biomedical data; coexpressed genes; data grouping; evolutionary algorithm; gene clusters; gene expression data; gene sequences; gene transcriptional regulation; generic clustering algorithm; hybrid GA; microarray hybridization; Clustering algorithms; DNA; Data mining; Evolutionary computation; Gene expression; Genetics; Partitioning algorithms; Pattern analysis; Sequences; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence, 2003. Proceedings. 15th IEEE International Conference on
  • ISSN
    1082-3409
  • Print_ISBN
    0-7695-2038-3
  • Type

    conf

  • DOI
    10.1109/TAI.2003.1250225
  • Filename
    1250225