DocumentCode
2442836
Title
Self-supervised learning for gene classification on microarray data
Author
Lu, Yijuan ; Tian, Qi ; Sanchez, Maribel ; Wang, Yufeng
Author_Institution
Dept. of Comput. Sci., Univ. of Texas at San Antonio, San Antonio, TX
fYear
2006
fDate
28-30 May 2006
Firstpage
105
Lastpage
106
Abstract
With the development of microarray technology, it provides massive amounts of high dimensional gene expression data simultaneously and most of their functions are unknown. Computational methods that can effectively resolve high dimensionality and small sample size problems for the high throughput data are valuable in systems biology. Self- supervised learning techniques, which take a hybrid of labeled and unlabeled data to train classifiers, can solve the problem efficiently. Discriminant-EM (DEM) proposes a framework for such tasks by applying self-supervised learning in an optimal discriminating subspace of the original feature space. In this paper, the linear algorithm is extended to a nonlinear kernel algorithm to capture the non-linearity in the data distribution. Extensive experiments on the Plasmodium falciparum dataset show the promising performance of the approach.
Keywords
biology computing; expectation-maximisation algorithm; genetics; learning (artificial intelligence); pattern classification; DEM; discriminant-expectation maximization; gene expression data classification; linear algorithm; microarray data; nonlinear kernel algorithm; self-supervised learning technique; systems biology; Biology computing; Classification algorithms; Computer science; Databases; Gene expression; Kernel; Simultaneous localization and mapping; Systems biology; Throughput; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Genomic Signal Processing and Statistics, 2006. GENSIPS '06. IEEE International Workshop on
Conference_Location
College Station, TX
Print_ISBN
1-4244-0384-7
Electronic_ISBN
1-4244-0385-5
Type
conf
DOI
10.1109/GENSIPS.2006.353178
Filename
4161799
Link To Document