Title :
Tumor Clustering Using Nonnegative Matrix Factorization With Gene Selection
Author :
Zheng, Chun-Hou ; Huang, De-Shuang ; Zhang, Lei ; Kong, Xiang-Zhen
fDate :
7/1/2009 12:00:00 AM
Abstract :
Tumor clustering is becoming a powerful method in cancer class discovery. Nonnegative matrix factorization (NMF) has shown advantages over other conventional clustering techniques. Nonetheless, there is still considerable room for improving the performance of NMF. To this end, in this paper, gene selection and explicitly enforcing sparseness are introduced into the factorization process. Particularly, independent component analysis is employed to select a subset of genes so that the effect of irrelevant or noisy genes can be reduced. The NMF and its extensions, sparse NMF and NMF with sparseness constraint, are then used for tumor clustering on the selected genes. A series of elaborate experiments are performed by varying the number of clusters and the number of selected genes to evaluate the cooperation between different gene selection settings and NMF-based clustering. Finally, the experiments on three representative gene expression datasets demonstrated that the proposed scheme can achieve better clustering results.
Keywords :
cancer; genetics; independent component analysis; matrix decomposition; medical diagnostic computing; tumours; cancer class discovery; gene expression; gene selection; independent component analysis; nonnegative matrix factorization; tumor clustering; Clustering; gene expression data; independent component analysis (ICA); nonnegative matrix factorization (NMF); tumor; Algorithms; Cluster Analysis; Databases, Genetic; Gene Expression; Humans; Models, Genetic; Neoplasms; Principal Component Analysis;
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
DOI :
10.1109/TITB.2009.2018115