• DocumentCode
    2414003
  • Title

    Exploring matrix factorization techniques for significant genes identification of microarray dataset

  • Author

    Kong, Wei ; Mou, Xiaoyang ; Hu, Xiaohua

  • Author_Institution
    Inf. Eng. Coll., Shanghai Maritime Univ., Shanghai, China
  • fYear
    2010
  • fDate
    18-21 Dec. 2010
  • Firstpage
    401
  • Lastpage
    405
  • Abstract
    Unsupervised machine learning approaches are efficient analysis tools for DNA microarray technique which can accumulate hundreds of thousands of genes expression levels in a single experiment. In our study, two unsupervised knowledge-based matrix factorization methods, independent component analysis (ICA) and nonnegative matrix factorization (NMF) are explored to identify significant genes and related pathways in microarray gene expression dataset. The advantage of these two approaches is they can be performed as a biclustering method by which genes and conditions can be clustered simultaneously. Furthermore, they can group genes into different categories for identifying related diagnostic pathways and regulatory networks. The difference between these two method lies in ICA assume statistical independence of the expression modes, while NMF need positivity constrains to generate localized gene expression profiles. By combining the significant genes identified by both ICA and NMF, the simulation results show great efficient for finding underlying biological processes and related pathways in Alzheimer´s disease (AD) and the activation patterns to AD phenotypes.
  • Keywords
    cellular biophysics; diseases; genetics; independent component analysis; lab-on-a-chip; matrix decomposition; medical diagnostic computing; molecular biophysics; unsupervised learning; Alzheimer disease; DNA microarray; ICA; activation patterns; biclustering method; gene identification; independent component analysis; localized gene expression profiles; microarray dataset; nonnegative matrix factorization; regulatory networks; related diagnostic pathways; unsupervised knowledge; unsupervised machine learning; Cancer; Encoding; Genetic expression; Independent component analysis; Alzheimer´s disease (AD); Biclustering; Independent component analysis (ICA); Nonnegative matrix factorization (NMF); matrix factorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-8306-8
  • Electronic_ISBN
    978-1-4244-8307-5
  • Type

    conf

  • DOI
    10.1109/BIBM.2010.5706599
  • Filename
    5706599