• DocumentCode
    3496882
  • Title

    Comparative study on dimension reduction techniques for cluster analysis of microarray data

  • Author

    Araújo, Daniel ; Neto, Adrião Dória ; Martins, Allan ; Melo, Jorge

  • Author_Institution
    Dept. of Comput. & Autom., Fed. Univ. of Rio Grande do Norte, Natal, Brazil
  • fYear
    2011
  • fDate
    July 31 2011-Aug. 5 2011
  • Firstpage
    1835
  • Lastpage
    1842
  • Abstract
    This paper proposes a study on the impact of the use of dimension reduction techniques (DRTs) in the quality of partitions produced by cluster analysis of microarray datasets. We tested seven DRTs applied to four microarray cancer datasets and ran four clustering algorithms using the original and reduced datasets. Overall results showed that using DRTs provides a improvement in performance of all algorithms tested, specially in the hierarchical class. We could see that, despite Principal Component Analysis (PCA) being the most widely used DRT, its was overcome by other nonlinear methods and it did not provide a substantial performance increase in the clustering algorithms. On the other hand, t-distributed Stochastic Embedding (t-SNE) and Laplacian Eigenmaps (LE) achieved good results for all datasets.
  • Keywords
    pattern clustering; principal component analysis; statistical distributions; Laplacian eigenmaps; data cluster analysis; dimension reduction technique; microarray cancer dataset; principal component analysis; t-distributed stochastic embedding; Algorithm design and analysis; Cancer; Clustering algorithms; Indexes; Kernel; Partitioning algorithms; Principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2011 International Joint Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4244-9635-8
  • Type

    conf

  • DOI
    10.1109/IJCNN.2011.6033447
  • Filename
    6033447