DocumentCode
2736722
Title
Model based clustering approach for identifying structural variation using next generation sequencing data
Author
Pyon, Yoon Soo ; Hayes, Matthew ; Li, Jing
Author_Institution
Dept. of Electr. Eng. & Comput. Sci., Case Western Reserve Univ., Cleveland, OH, USA
fYear
2011
fDate
3-5 Feb. 2011
Firstpage
153
Lastpage
158
Abstract
Structural variation (SV) has been reported to be associated with numerous diseases such as cancer. With the advent of next generation sequencing (NGS) technologies, various types of SV can be potentially identified. We propose a model based clustering approach utilizing a set of features defined for each type of SV event. Our method, termed SVMiner, not only provides a probability score for each candidate, but also predicts the heterozygosity of genomic deletions. Experiments on genome-wide deep sequencing data have demonstrated that SVMiner is robust against the variability of a single cluster feature, and it performs well when classifying validated SV events with accentuated features.
Keywords
DNA; biology computing; cancer; data mining; genomics; molecular biophysics; molecular configurations; probability; SVMiner; cancer; diseases; genome-wide deep sequencing data; genomic deletions; heterozygosity; model-based clustering approach; probability score; structural variation; Bioinformatics; Biological cells; Clustering algorithms; Genomics; Humans; Prediction algorithms; Sensitivity; model-based clustering; next generation sequencing; structural variation;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
Conference_Location
Orlando, FL
Print_ISBN
978-1-61284-851-8
Type
conf
DOI
10.1109/ICCABS.2011.5729871
Filename
5729871
Link To Document