• DocumentCode
    2442836
  • Title

    Self-supervised learning for gene classification on microarray data

  • Author

    Lu, Yijuan ; Tian, Qi ; Sanchez, Maribel ; Wang, Yufeng

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas at San Antonio, San Antonio, TX
  • fYear
    2006
  • fDate
    28-30 May 2006
  • Firstpage
    105
  • Lastpage
    106
  • Abstract
    With the development of microarray technology, it provides massive amounts of high dimensional gene expression data simultaneously and most of their functions are unknown. Computational methods that can effectively resolve high dimensionality and small sample size problems for the high throughput data are valuable in systems biology. Self- supervised learning techniques, which take a hybrid of labeled and unlabeled data to train classifiers, can solve the problem efficiently. Discriminant-EM (DEM) proposes a framework for such tasks by applying self-supervised learning in an optimal discriminating subspace of the original feature space. In this paper, the linear algorithm is extended to a nonlinear kernel algorithm to capture the non-linearity in the data distribution. Extensive experiments on the Plasmodium falciparum dataset show the promising performance of the approach.
  • Keywords
    biology computing; expectation-maximisation algorithm; genetics; learning (artificial intelligence); pattern classification; DEM; discriminant-expectation maximization; gene expression data classification; linear algorithm; microarray data; nonlinear kernel algorithm; self-supervised learning technique; systems biology; Biology computing; Classification algorithms; Computer science; Databases; Gene expression; Kernel; Simultaneous localization and mapping; Systems biology; Throughput; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics, 2006. GENSIPS '06. IEEE International Workshop on
  • Conference_Location
    College Station, TX
  • Print_ISBN
    1-4244-0384-7
  • Electronic_ISBN
    1-4244-0385-5
  • Type

    conf

  • DOI
    10.1109/GENSIPS.2006.353178
  • Filename
    4161799