• DocumentCode
    3230827
  • Title

    A framework for high dimensional data reduction in the microarray domain

  • Author

    Anaissi, Ali ; Kennedy, Paul J. ; Goyal, Madhu

  • Author_Institution
    Fac. of Eng. & Inf. Technol. (FElT), Univ. of Technol., Broadway, NSW, Australia
  • fYear
    2010
  • fDate
    23-26 Sept. 2010
  • Firstpage
    903
  • Lastpage
    907
  • Abstract
    Microarray analysis and visualization is very helpful for biologists and clinicians to understand gene expression in cells and to facilitate diagnosis and treatment of patients. However, a typical microarray dataset has thousands of features and a very small number of observations. This very high dimensional data has a massive amount of information which often contains some noise, non-useful information and small number of relevant features for disease or genotype. This paper proposes a framework for very high dimensional data reduction based on three technologies: feature selection, linear dimensionality reduction and non-linear dimensionality reduction. In this paper, feature selection based on mutual information will be proposed for filtering features and selecting the most relevant features with the minimum redundancy. A kernel linear dimensionality reduction method is also used to extract the latent variables from a high dimensional data set. In addition, a non-linear dimensionality reduction based on local linear embedding is used to reduce the dimension and visualize the data. Experimental results are presented to show the outputs of each step and the efficiency of this framework.
  • Keywords
    data analysis; data reduction; data visualization; disease; feature selection; gene expression; genotype; kernel linear dimensionality reduction; microarray analysis; microarray dataset; microarray domain; mutual information; nonlinear dimensionality reduction; patient diagnosis; patient treatment; very high dimensional data reduction; Australia; Visualization; Feature Selection; Linear dimension Reduction; Non-Linear Dimension Reduction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bio-Inspired Computing: Theories and Applications (BIC-TA), 2010 IEEE Fifth International Conference on
  • Conference_Location
    Changsha
  • Print_ISBN
    978-1-4244-6437-1
  • Type

    conf

  • DOI
    10.1109/BICTA.2010.5645247
  • Filename
    5645247