Title :
Model based clustering approach for identifying structural variation using next generation sequencing data
Author :
Pyon, Yoon Soo ; Hayes, Matthew ; Li, Jing
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Case Western Reserve Univ., Cleveland, OH, USA
Abstract :
Structural variation (SV) has been reported to be associated with numerous diseases such as cancer. With the advent of next generation sequencing (NGS) technologies, various types of SV can be potentially identified. We propose a model based clustering approach utilizing a set of features defined for each type of SV event. Our method, termed SVMiner, not only provides a probability score for each candidate, but also predicts the heterozygosity of genomic deletions. Experiments on genome-wide deep sequencing data have demonstrated that SVMiner is robust against the variability of a single cluster feature, and it performs well when classifying validated SV events with accentuated features.
Keywords :
DNA; biology computing; cancer; data mining; genomics; molecular biophysics; molecular configurations; probability; SVMiner; cancer; diseases; genome-wide deep sequencing data; genomic deletions; heterozygosity; model-based clustering approach; probability score; structural variation; Bioinformatics; Biological cells; Clustering algorithms; Genomics; Humans; Prediction algorithms; Sensitivity; model-based clustering; next generation sequencing; structural variation;
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-61284-851-8
DOI :
10.1109/ICCABS.2011.5729871