• DocumentCode
    1392064
  • Title

    A Preprocessing Procedure for Haplotype Inference by Pure Parsimony

  • Author

    Irurozki, Ekhine ; Calvo, Borja ; Lozano, Jose A.

  • Author_Institution
    Dept. of Comput. Sci. & Artificial Intell., Univ. of the Basque Country, Donostia, Spain
  • Volume
    8
  • Issue
    5
  • fYear
    2011
  • Firstpage
    1183
  • Lastpage
    1195
  • Abstract
    Haplotype data are especially important in the study of complex diseases since it contains more information than genotype data. However, obtaining haplotype data is technically difficult and costly. Computational methods have proved to be an effective way of inferring haplotype data from genotype data. One of these methods, the haplotype inference by pure parsimony approach (HIPP), casts the problem as an optimization problem and as such has been proved to be NP-hard. We have designed and developed a new preprocessing procedure for this problem. Our proposed algorithm works with groups of haplotypes rather than individual haplotypes. It iterates searching and deleting haplotypes that are not helpful in order to find the optimal solution. This preprocess can be coupled with any of the current solvers for the HIPP that need to preprocess the genotype data. In order to test it, we have used two state-of-the-art solvers, RTIP and GAHAP, and simulated and real HapMap data. Due to the computational time and memory reduction caused by our preprocess, problem instances that were previously unaffordable can be now efficiently solved.
  • Keywords
    DNA; bioinformatics; computational complexity; diseases; genetics; inference mechanisms; iterative methods; molecular biophysics; molecular configurations; optimisation; GAHAP; HIPP; HapMap data; NP-hard problem; RTIP; complex diseases; computational time; genotype data; haplotype deletion; haplotype inference; haplotype searching; iterative method; memory reduction; optimization; preprocessing procedure; pure parsimony; Bioinformatics; Computational biology; DNA; Diseases; Inference algorithms; Optimization; Biology and genetics; haplotype inference; optimization.; Algorithms; Computational Biology; HapMap Project; Haplotypes; Humans; Models, Genetic;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2010.125
  • Filename
    5654503