• DocumentCode
    1992064
  • Title

    Iterative Refinement of Repeat Sequence Specification Using Constrained Pattern Matching

  • Author

    He, Dan ; Arslan, Abdullah N. ; He, Yu ; Wu, Xindong

  • Author_Institution
    Univ. of Vermont, Burlington
  • fYear
    2007
  • fDate
    14-17 Oct. 2007
  • Firstpage
    1199
  • Lastpage
    1203
  • Abstract
    Repeated sequences in genome are structures which indicate important biological functions such as protein binding. They are associated with various genetic diseases. We consider the problem of finding a specification for a "significant" repeating pattern in a given sequence. A significant pattern carries high amount of information, and it has many non-overlapping repeats. We propose for this problem, a method that takes as input an initial specification for a repeating pattern. A pattern is specified by a sequence of letters separated by varying length wildcards. The method presents to the user maximal occurrences for the current pattern specification in a way that no text symbol can be shared as a letter by two different pattern occurrences. This reduces the begin-end position-overlaps among different occurrences. The user modifies the specification manually to eliminate overlapping repeats. This process continues until a specification for a significant pattern is obtained.
  • Keywords
    biological techniques; cellular biophysics; diseases; genetics; molecular biophysics; proteins; begin-end position-overlaps; constrained pattern matching; current pattern specification; genetic diseases; genome; iterative refinement; nonoverlapping repeats; protein binding; repeat sequence specification; repeating pattern; Bioinformatics; Biology; Computer science; Diseases; Genetics; Genomics; Helium; Humans; Pattern matching; Proteins; edge disjoint path; maximum flow; pattern matching with wildcards; repeats; vertex-disjoints path;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4244-1509-0
  • Type

    conf

  • DOI
    10.1109/BIBE.2007.4375715
  • Filename
    4375715