• DocumentCode
    130375
  • Title

    Change-point detection in binary Markov DNA sequences by the Cross-Entropy method

  • Author

    Polushina, Tatiana ; Sofronov, Georgy

  • Author_Institution
    Dept. of Clinical Sci., Univ. of Bergen, Bergen, Norway
  • fYear
    2014
  • fDate
    7-10 Sept. 2014
  • Firstpage
    471
  • Lastpage
    478
  • Abstract
    A deoxyribonucleic acid (DNA) sequence can be represented as a sequence with 4 characters. If a particular property of the DNA is studied, for example, GC content, then it is possible to consider a binary sequence. In many cases, if the probabilistic properties of a segment differ from the neighbouring ones, this means that the segment can play a structural role. Therefore, DNA segmentation is given a special attention, and it is one of the most significant applications of change-point detection. Problems of this type also arise in a wide variety of areas, for example, seismology, industry (e.g., fault detection), biomedical signal processing, financial mathematics, speech and image processing. In this study, we have developed a Cross-Entropy algorithm for identifying change-points in binary sequences with first-order Markov dependence. We propose a statistical model for this problem and show effectiveness of our algorithm for synthetic and real datasets.
  • Keywords
    DNA; Markov processes; biology computing; entropy; molecular biophysics; molecular configurations; statistical analysis; DNA segmentation; GC content; binary Markov DNA sequences; biomedical signal processing; change-point detection; cross-entropy method; deoxyribonucleic acid sequence; fault detection; financial mathematics; first-order Markov dependence; image processing; probabilistic properties; real datasets; seismology; speech processing; statistical model; synthetic datasets; DNA; Educational institutions; Estimation; Genomics; Markov processes; Optimization; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Systems (FedCSIS), 2014 Federated Conference on
  • Conference_Location
    Warsaw
  • Type

    conf

  • DOI
    10.15439/2014F88
  • Filename
    6933053