• DocumentCode
    1966376
  • Title

    MotifHider: A knowledge hiding approach to sequence masking

  • Author

    Abul, Osman

  • Author_Institution
    Dept. of Comput. Eng., TOBB Univ. of Econ. & Technol., Ankara, Turkey
  • fYear
    2009
  • fDate
    14-16 Sept. 2009
  • Firstpage
    171
  • Lastpage
    176
  • Abstract
    In a typical de novo motif discovery process, it is quite common that many of the motif candidates output from motif discovery programs are either already known motifs or motif-like decoy/repeat patterns. To prevent the false discovery and also to increase the chance of authentic novel motif discovery, some motif discovery programs employ a pre-processing stage in order to mask certain repeat positions in the input sequences. There are a few approaches to sequence masking aimed at avoiding the false discovery. This paper introduces a novel approach and a tool, called MotifHider, to sequence masking problem. MotifHider exploits sensitive knowledge hiding principles from database sharing. By hiding certain patterns, it provides successive motif discovery programs to avoid false discovery and rediscovery. At the same time, it avoids overly distortion of the input sequences so as to retain most of the authentic motifs.
  • Keywords
    data encapsulation; data mining; MotifHider; database sharing; de novo motif discovery process; knowledge hiding approach; sequence masking problem; Bioinformatics; Biological information theory; Biology computing; DNA; Databases; Evolution (biology); Genomics; Knowledge engineering; Libraries; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Sciences, 2009. ISCIS 2009. 24th International Symposium on
  • Conference_Location
    Guzelyurt
  • Print_ISBN
    978-1-4244-5021-3
  • Electronic_ISBN
    978-1-4244-5023-7
  • Type

    conf

  • DOI
    10.1109/ISCIS.2009.5291843
  • Filename
    5291843