• DocumentCode
    2460120
  • Title

    Gene Expression Data Classification Using Locality Preserving Projections

  • Author

    Bian, Houqin ; Chung, Ronald

  • Author_Institution
    Inst. of Comput. & Inf. Eng., Shanghai Univ. of Electr. Power, Shanghai, China
  • fYear
    2011
  • fDate
    24-26 Oct. 2011
  • Firstpage
    66
  • Lastpage
    71
  • Abstract
    Classification analysis of gene expression data could lead to knowledge of gene functions and diseases mechanisms. However, the data involve nonlinear interactions among genes and environmental factors. Worst yet, while the data are usually of high dimensions, the sample sizes acquirable are generally relatively small, resulting in the well known difficulty - the curse of dimensionality - in the classification task. This work describes how gene expression data can be analyzed using Locality Preserving Projections (LPP) manifold learning method. LPP is a dimensionality reduction strategy for feature selection and visualization. Using LPP, the high dimensional gene expression data are mapped to a low dimensional subspace for data analysis. LPP finds the optimal linear approximations to the eigenfunctions of the Laplace Beltrami operator on the manifold. Not only does it share many convenient data-representation properties of the nonlinear techniques like Laplacian Eigenmaps or Locally Linear Embedding, it is also linear and more crucially is defined everywhere in the ambient space rather than just on the training data points. Comparative experimental results with PC A, LDA, LLE, etc. on different gene expression datasets show that the LPP-based method has the potential of being more efficient for complex gene expression data classification.
  • Keywords
    Laplace equations; bioinformatics; eigenvalues and eigenfunctions; genetic algorithms; genetics; Laplace Beltrami operator; Laplacian eigenmaps; data classification; data-representation properties; dimensionality reduction strategy; disease mechanisms; eigenfunctions; environmental factors; feature selection; feature visualization; gene expression; gene functions; locality preserving projections; locally linear embedding; low-dimensional subspace; nonlinear interactions; optimal linear approximations; Accuracy; Eigenvalues and eigenfunctions; Feature extraction; Gene expression; Manifolds; Principal component analysis; Symmetric matrices; classification; local preserving projection; microarray data analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering (BIBE), 2011 IEEE 11th International Conference on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-1-61284-975-1
  • Type

    conf

  • DOI
    10.1109/BIBE.2011.17
  • Filename
    6089809