Title :
Self-supervised learning for gene classification on microarray data
Author :
Lu, Yijuan ; Tian, Qi ; Sanchez, Maribel ; Wang, Yufeng
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at San Antonio, San Antonio, TX
Abstract :
With the development of microarray technology, it provides massive amounts of high dimensional gene expression data simultaneously and most of their functions are unknown. Computational methods that can effectively resolve high dimensionality and small sample size problems for the high throughput data are valuable in systems biology. Self- supervised learning techniques, which take a hybrid of labeled and unlabeled data to train classifiers, can solve the problem efficiently. Discriminant-EM (DEM) proposes a framework for such tasks by applying self-supervised learning in an optimal discriminating subspace of the original feature space. In this paper, the linear algorithm is extended to a nonlinear kernel algorithm to capture the non-linearity in the data distribution. Extensive experiments on the Plasmodium falciparum dataset show the promising performance of the approach.
Keywords :
biology computing; expectation-maximisation algorithm; genetics; learning (artificial intelligence); pattern classification; DEM; discriminant-expectation maximization; gene expression data classification; linear algorithm; microarray data; nonlinear kernel algorithm; self-supervised learning technique; systems biology; Biology computing; Classification algorithms; Computer science; Databases; Gene expression; Kernel; Simultaneous localization and mapping; Systems biology; Throughput; Training data;
Conference_Titel :
Genomic Signal Processing and Statistics, 2006. GENSIPS '06. IEEE International Workshop on
Conference_Location :
College Station, TX
Print_ISBN :
1-4244-0384-7
Electronic_ISBN :
1-4244-0385-5
DOI :
10.1109/GENSIPS.2006.353178