• DocumentCode
    3602521
  • Title

    DPNuc: Identifying Nucleosome Positions Based on the Dirichlet Process Mixture Model

  • Author

    Huidong Chen ; Jihong Guan ; Shuigeng Zhou

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tongji Univ., Shanghai, China
  • Volume
    12
  • Issue
    6
  • fYear
    2015
  • Firstpage
    1236
  • Lastpage
    1247
  • Abstract
    Nucleosomes and the free linker DNA between them assemble the chromatin. Nucleosome positioning plays an important role in gene transcription regulation, DNA replication and repair, alternative splicing, and so on. With the rapid development of ChIP-seq, it is possible to computationally detect the positions of nucleosomes on chromosomes. However, existing methods cannot provide accurate and detailed information about the detected nucleosomes, especially for the nucleosomes with complex configurations where overlaps and noise exist. Meanwhile, they usually require some prior knowledge of nucleosomes as input, such as the size or the number of the unknown nucleosomes, which may significantly influence the detection results. In this paper, we propose a novel approach DPNuc for identifying nucleosome positions based on the Dirichlet process mixture model. In our method, Markov chain Monte Carlo (MCMC) simulations are employed to determine the mixture model with no need of prior knowledge about nucleosomes. Compared with three existing methods, our approach can provide more detailed information of the detected nucleosomes and can more reasonably reveal the real configurations of the chromosomes; especially, our approach performs better in the complex overlapping situations. By mapping the detected nucleosomes to a synthetic benchmark nucleosome map and two existing benchmark nucleosome maps, it is shown that our approach achieves a better performance in identifying nucleosome positions and gets a higher F-score. Finally, we show that our approach can more reliably detect the size distribution of nucleosomes.
  • Keywords
    DNA; Markov processes; Monte Carlo methods; biological techniques; cellular biophysics; molecular biophysics; molecular configurations; ChIP-seq; DNA repair; DNA replication; DPNuc; Dirichlet process mixture model; Markov chain Monte Carlo simulation; chromatin; chromosome; gene transcription regulation; nucleosome position; nucleosome size distribution; synthetic benchmark nucleosome map; Bioinformatics; Biological cells; Computational modeling; DNA; Genomics; Noise measurement;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2015.2430350
  • Filename
    7112520