• DocumentCode
    527420
  • Title

    A semi-supervised style method for haplotype assembly problem based on MEC model

  • Author

    Xu, Xinshun

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
  • Volume
    3
  • fYear
    2010
  • fDate
    10-12 Aug. 2010
  • Firstpage
    1508
  • Lastpage
    1512
  • Abstract
    Haplotype reconstruction based on aligned SNP fragments is to infer a pair of haplotypes from localized polymorphism data got through short genome fragments. For this problem, the minimum error correction (MEC) model is one of important computational models. This model constructs a pair of haplotypes by correcting minimum SNPs in genome fragments of an individual´s DNA. In this paper, a semi-supervised competitive neural network on the MEC model is proposed. This algorithm aims at clustering all fragments into two sets. The fragments in each set can then be used to construct a haplotype with minimum SNPs corrected. Although the architecture of the proposed method is simple, it outperforms other two algorithms on most instances of both real data and simulation data. So, the results show that the proposed semi-supervised neutral network is effective. The results also show that semi-supervised algorithm is feasible and promising for this problem.
  • Keywords
    bioinformatics; learning (artificial intelligence); neural nets; MEC model; SNP fragments; haplotype assembly problem; haplotype reconstruction; localized polymorphism data; minimum error correction model; semisupervised competitive neural network; semisupervised style method; short genome fragments; Artificial neural networks; Assembly; Bioinformatics; Data models; Genomics; Mathematical model; Neurons; bioinformatics; haplotype assembly; minimum error correction; neural network; semi-supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation (ICNC), 2010 Sixth International Conference on
  • Conference_Location
    Yantai, Shandong
  • Print_ISBN
    978-1-4244-5958-2
  • Type

    conf

  • DOI
    10.1109/ICNC.2010.5582649
  • Filename
    5582649