DocumentCode :
1966376
Title :
MotifHider: A knowledge hiding approach to sequence masking
Author :
Abul, Osman
Author_Institution :
Dept. of Comput. Eng., TOBB Univ. of Econ. & Technol., Ankara, Turkey
fYear :
2009
fDate :
14-16 Sept. 2009
Firstpage :
171
Lastpage :
176
Abstract :
In a typical de novo motif discovery process, it is quite common that many of the motif candidates output from motif discovery programs are either already known motifs or motif-like decoy/repeat patterns. To prevent the false discovery and also to increase the chance of authentic novel motif discovery, some motif discovery programs employ a pre-processing stage in order to mask certain repeat positions in the input sequences. There are a few approaches to sequence masking aimed at avoiding the false discovery. This paper introduces a novel approach and a tool, called MotifHider, to sequence masking problem. MotifHider exploits sensitive knowledge hiding principles from database sharing. By hiding certain patterns, it provides successive motif discovery programs to avoid false discovery and rediscovery. At the same time, it avoids overly distortion of the input sequences so as to retain most of the authentic motifs.
Keywords :
data encapsulation; data mining; MotifHider; database sharing; de novo motif discovery process; knowledge hiding approach; sequence masking problem; Bioinformatics; Biological information theory; Biology computing; DNA; Databases; Evolution (biology); Genomics; Knowledge engineering; Libraries; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Sciences, 2009. ISCIS 2009. 24th International Symposium on
Conference_Location :
Guzelyurt
Print_ISBN :
978-1-4244-5021-3
Electronic_ISBN :
978-1-4244-5023-7
Type :
conf
DOI :
10.1109/ISCIS.2009.5291843
Filename :
5291843
Link To Document :
بازگشت