• DocumentCode
    191017
  • Title

    Improving bisulfite short-read mapping efficiency with hairpin-bisulfite data

  • Author

    Porter, Joseph ; Ming-an Sun ; Hehuang Xie ; Liqing Zhang

  • Author_Institution
    Comput. Sci., Virginia Tech, Blacksburg, VA, USA
  • fYear
    2014
  • fDate
    2-4 June 2014
  • Firstpage
    1
  • Lastpage
    2
  • Abstract
    DNA methylation is an important epigenetic mark relevant to normal development and disease genesis. A common approach to characterizing genome-wide DNA methylation is to use Next Generation Sequencing technology to sequence bisulfite treated DNA. The short sequence reads are mapped to the reference genome to determine the methylation status of Cs. However, despite intense effort, a much smaller proportion of the reads derived from bisulfite treated DNA (usually about 40-80%) can be mapped than regular short reads mapping (≥ 90%), and it is unclear what factors lead to this low mapping efficiency. To address this issue, we used the hairpin sequencing technology to determine sequences of both DNA double strands simultaneously. This enabled the recovery of the original non-bisulfite-converted sequences. Recovered reads had a higher unique mapping efficiency than the bisulfite reads, and one reason could be that mapping efficiency relates to entropy, and entropy may be lost due to bisulfite treatment. Bismark was used to map bisulfite reads, and Bowtie2 was used to map recovered untreated reads.
  • Keywords
    DNA; diseases; entropy; genetics; genomics; molecular biophysics; DNA double strands; bisulfite short-read mapping efficiency; disease genesis; entropy; epigenetic mark; genome-wide DNA methylation characterization; genomics; hairpin-bisulfite data; next generation sequencing technology; Bioinformatics; DNA; Entropy; Genomics; Mice; Sequential analysis; Software; Bisulfite short read mapping; Methylation; Next-generation Sequencing; Sequence Analysis; hairpin bisulfite data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Advances in Bio and Medical Sciences (ICCABS), 2014 IEEE 4th International Conference on
  • Conference_Location
    Miami, FL
  • Print_ISBN
    978-1-4799-5786-6
  • Type

    conf

  • DOI
    10.1109/ICCABS.2014.6863920
  • Filename
    6863920