• DocumentCode
    3074077
  • Title

    Tri-Cluster-Tri-Scheme-Training: Exploiting Unlabeled Data for Transmembrane Segments Prediction

  • Author

    He, Jieyue ; Harrison, R. ; Tai, Phang C. ; Pan, Yi

  • fYear
    2009
  • fDate
    22-24 June 2009
  • Firstpage
    168
  • Lastpage
    175
  • Abstract
    Recent work using supervised learning for protein structure prediction has achieved state-of-the-art classification performance. However, such methods are based only on labeled data, while in practice the labeled data is so few and expensive to obtain and unlabeled data is far more plentiful. An effective way to enhance the performance of the learned hypothesis by using the labeled and unlabeled data together is known as semi-supervised learning. Although there are lots of semi-supervised learning methods, those approaches could not always achieve the acceptable results for bioinformatics application, especially when there is only very few labeled instances. Therefore, in this paper, we present a novel, more effective method tri-cluster-tri-scheme-training (TCTS) which firstly uses tri-cluster to label some high confidence unlabeled instances and then refines the classifiers by utilizing both of the label data and unlabeled data in the Tri-Scheme-training by different schemes. The encouraging experimental results indicate that TCTS algorithm opens a new way to solve the complex classification problem when very few labeled datasets are available.
  • Keywords
    bioinformatics; learning (artificial intelligence); molecular biophysics; pattern classification; pattern clustering; proteins; bioinformatics application; classifier; semi-supervised learning; transmembrane segments prediction; tri-cluster-tri-scheme-training; tri-scheme-training; unlabeled data; Bioinformatics; Biomembranes; Computer science; Labeling; Nuclear magnetic resonance; Parameter estimation; Proteins; Semisupervised learning; Sequences; USA Councils; Transmembrane Segments Prediction; semi-supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and BioEngineering, 2009. BIBE '09. Ninth IEEE International Conference on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-0-7695-3656-9
  • Type

    conf

  • DOI
    10.1109/BIBE.2009.15
  • Filename
    5211288