مرکز منطقه ای اطلاع رساني علوم و فناوري - K-means+ method for improving gene selection for classification of microarray data

DocumentCode :

2527537

Title :

K-means+ method for improving gene selection for classification of microarray data

Author :

Huang, Heng ; Zhang, Rong ; Xiong, Fei ; Makedon, Fillia ; Shen, Li ; Hettleman, Bruce ; Pearlman, Justin

Author_Institution :

Dept. of Comput. Sci., Dartmouth Coll., Hanover, NH, USA

fYear :

2005

fDate :

8-11 Aug. 2005

Firstpage :

110

Lastpage :

111

Abstract :

Microarray gene expression techniques have recently made it possible to offer phenotype classification of many diseases. One problem in this analysis is that each sample is represented by quite a large number of genes, and many of them are insignificant or redundant to clarify the disease problem. The previous work has shown that selecting informative genes from microarray data can improve the accuracy of classification. Clustering methods have been successfully applied to group similar genes and select informative genes from them to avoid redundancy and extract biological information from them. A problem with these approaches is that the number of clusters must be given and it is time-consuming to fry all possible numbers for clusters. In this paper, a heuristic, called K-means+, is used to address the number of clusters dependency and degeneracy problems. The result of our experiments shows that K-means+ method can automatically partition genes into a reasonable number of clusters and then the informative genes are selected from clusters.

Keywords :

biology computing; diseases; genetics; information retrieval systems; pattern classification; pattern clustering; biological information extraction; clustering method; diseases; heuristic; informative genes; microarray data classification; microarray gene expression technique; phenotype classification; Cardiology; Chebyshev approximation; Computer science; Data analysis; Distributed computing; Educational institutions; Gaussian distribution; Gene expression; Information science; Statistical distributions;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computational Systems Bioinformatics Conference, 2005. Workshops and Poster Abstracts. IEEE

Print_ISBN :

0-7695-2442-7

Type :

conf

DOI :

10.1109/CSBW.2005.82

Filename :

1540562

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2527537