Title :
Poster: A Hidden Markov Model for Copy Number Variant prediction from Whole genome resequencing data
Author :
Shen, Yufeng ; Gu, Yiwei ; Pe´er, Itsik
Author_Institution :
Dept. of Comput. Sci., Columbia Univ., New York, NY, USA
Abstract :
Copy number variants (CNVs) are important genetic factors for studying human diseases. While high-throughput whole genome re-sequencing provides multiple lines of evidence for detecting CNVs, computational algorithms need to be tailored for different type or size of CNVs under different experimental designs. To achieve optimal power and resolution of detecting CNVs at low-depth coverage, a Hidden Markov Model that integrates both depth of coverage and mate-pair relationship was implemented. The novelty of the algorithm is the inference of the likelihood of carrying a deletion jointly from multiple mate pairs in a region without the requirement of a single mate pairs being obvious outliers. By integrating all useful information in a comprehensive model, the method is able to detect medium-size deletions (100-2000 bp) at low depth-coverage (<;10× per sample). The method is applied to simulated data and demonstrates the power of detecting medium size deletions is close to theoretical values.
Keywords :
biology computing; genetics; genomics; hidden Markov models; inference mechanisms; molecular biophysics; molecular configurations; copy number variant prediction; coverage; genetic factors; hidden Markov model; human diseases; inference; mate-pair relationship; medium-size deletions; multiple mate pairs; whole genome resequencing data; Algorithm design and analysis; Bioinformatics; Computational biology; Computer science; Genomics; Hidden Markov models; Copy Number Variants; Hidden Markov Model; High-throughput sequencing; Whole Genome Sequencing;
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-61284-851-8
DOI :
10.1109/ICCABS.2011.5729914