• DocumentCode
    1945975
  • Title

    Clustering, Assessment and Validation: an application to gene expression data

  • Author

    Ciaramella, A. ; Cocozza, S. ; Iorio, F. ; Miele, G. ; Napolitano, F. ; Pinelli, M. ; Raiconi, G. ; Tagliaferri, R.

  • Author_Institution
    DMI, Salerno Univ., Salerno
  • fYear
    2007
  • fDate
    12-17 Aug. 2007
  • Firstpage
    1613
  • Lastpage
    1618
  • Abstract
    In this work a multi-step approach for clustering assessment, visualization and data validation is introduced. Three main approaches for data clustering are used and compared: K-means, self organizing maps and probabilistic principal surfaces. A model explorer approach with different similarity measures is used to obtain the best parameters of the methods. The approach is used to identify genes periodically expressed in tumors related to the human cell cycle. Finally, clusters are validated by using GO term information.
  • Keywords
    biology computing; data visualisation; genetics; pattern clustering; tumours; GO Term information; K-means clustering; data assessment; data clustering; data validation; data visualization; gene expression data; human cell cycle; model explorer approach; probabilistic principal surfaces; self organizing maps; tumors; Bioinformatics; Cells (biology); Data mining; Data visualization; Gene expression; Genetics; Genomics; Humans; Independent component analysis; Principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2007. IJCNN 2007. International Joint Conference on
  • Conference_Location
    Orlando, FL
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-1379-9
  • Electronic_ISBN
    1098-7576
  • Type

    conf

  • DOI
    10.1109/IJCNN.2007.4371199
  • Filename
    4371199