Title :
A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information
Author :
Pirola, Yuri ; Vedova, Gianluca Della ; Biffani, Stefano ; Stella, Alessandra ; Bonizzoni, Paola
Author_Institution :
DISCo, Univ. degli Studi di Milano-Bicocca, Milan, Italy
Abstract :
In this work, we propose the MIN-RECOMBINANT HAPLOTYPE CONFIGURATION WITH BOUNDED ERRORS problem (MRHCE), which extends the original MIN-RECOMBINANT HAPLOTYPE CONFIGURATION formulation by incorporating two common characteristics of real data: errors and missing genotypes (including untyped individuals). We describe a practical algorithm for MRHCE that is based on a reduction to the Satisfiability problem (SAT) and exploits recent advances in the constraint programming literature. An experimental analysis demonstrates the soundness of our model and the effectiveness of the algorithm under several scenarios. The analysis on real data and the comparison with state-of-the-art programs reveals that our approach couples better scalability to large and complex pedigrees with the explicit inclusion of genotyping errors into the model.
Keywords :
biomedical communication; computability; medical computing; medical information systems; physiological models; bounded errors problem; constraint programming literature; genotype phasing; genotyping errors; minrecombinant haplotype configuration formulation; pedigree imputation; practical algorithm; satisfiability problem; state-of-the-art programs; Accuracy; Agriculture; Bioinformatics; Error analysis; Genomics; Vectors; genotyping errors; haplotype inference; missing data; pedigrees; recombinations;
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2012 IEEE 2nd International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-1320-9
Electronic_ISBN :
978-1-4673-1319-3
DOI :
10.1109/ICCABS.2012.6182643