DocumentCode
1945975
Title
Clustering, Assessment and Validation: an application to gene expression data
Author
Ciaramella, A. ; Cocozza, S. ; Iorio, F. ; Miele, G. ; Napolitano, F. ; Pinelli, M. ; Raiconi, G. ; Tagliaferri, R.
Author_Institution
DMI, Salerno Univ., Salerno
fYear
2007
fDate
12-17 Aug. 2007
Firstpage
1613
Lastpage
1618
Abstract
In this work a multi-step approach for clustering assessment, visualization and data validation is introduced. Three main approaches for data clustering are used and compared: K-means, self organizing maps and probabilistic principal surfaces. A model explorer approach with different similarity measures is used to obtain the best parameters of the methods. The approach is used to identify genes periodically expressed in tumors related to the human cell cycle. Finally, clusters are validated by using GO term information.
Keywords
biology computing; data visualisation; genetics; pattern clustering; tumours; GO Term information; K-means clustering; data assessment; data clustering; data validation; data visualization; gene expression data; human cell cycle; model explorer approach; probabilistic principal surfaces; self organizing maps; tumors; Bioinformatics; Cells (biology); Data mining; Data visualization; Gene expression; Genetics; Genomics; Humans; Independent component analysis; Principal component analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2007. IJCNN 2007. International Joint Conference on
Conference_Location
Orlando, FL
ISSN
1098-7576
Print_ISBN
978-1-4244-1379-9
Electronic_ISBN
1098-7576
Type
conf
DOI
10.1109/IJCNN.2007.4371199
Filename
4371199
Link To Document