• DocumentCode
    1220
  • Title

    Common Copy Number Variation Detection From Multiple Sequenced Samples

  • Author

    Junbo Duan ; Hong-Wen Deng ; Yu-Ping Wang

  • Author_Institution
    Dept. of Biomed. Eng., Xi´an Jiaotong Univ., Xi´an, China
  • Volume
    61
  • Issue
    3
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    928
  • Lastpage
    937
  • Abstract
    Common copy number variations (CNVs) are small regions of genomic variations at the same loci across multiple samples, which can be detected with high resolution from next-generation sequencing (NGS) technique. Multiple sequencing data samples are often available from genomic studies; examples include sequences from multiple platforms and sequences from multiple individuals. By integrating complementary information from multiple data samples, detection power can be potentially improved. However, most of current CNV detection methods often process an individual sequence sample, or two samples in an abnormal versus matched normal study; researches on detecting common CNVs across multiple samples have been very limited but are much needed. In this paper, we propose a novel method to detect common CNVs from multiple sequencing samples by exploiting the concurrency of genomic variations in read depth signals derived from multiple NGS data. We use a penalized sparse regression model to fit multiple read depth profiles, based on which common CNV identification is formulated as a change-point detection problem. Finally, we validate the proposed method on both simulation and real data, showing that it can give both higher detection power and better break point estimation over several published CNV detection methods.
  • Keywords
    diseases; genomics; patient diagnosis; physiological models; regression analysis; CNV detection methods; break point estimation; complementary information; copy number variation detection; genomic variations; multiple NGS data; multiple data samples; multiple read depth profiles; multiple sequenced samples; multiple sequencing data samples; next-generation sequencing technique; read depth signals; sparse regression model; Bioinformatics; Dispersion; Estimation; Genomics; Matching pursuit algorithms; Sequential analysis; Silicon carbide; $ell$-0 norm penalty; Copy number variation (CNV); Schur complement; model selection; next generation sequencing (NGS); structured sparse modeling; the 1000 genomes project;
  • fLanguage
    English
  • Journal_Title
    Biomedical Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9294
  • Type

    jour

  • DOI
    10.1109/TBME.2013.2292588
  • Filename
    6675802