DocumentCode :
2094160
Title :
A Novel Approach to Extract Structured Motifs by Multi-Objective Genetic Algorithm
Author :
Kaya, Mehmet ; Guc, M.
Author_Institution :
Dept. of Comput. Eng., Firat Univ., Elazg
fYear :
2008
fDate :
17-19 June 2008
Firstpage :
278
Lastpage :
283
Abstract :
The functional motifs composed of several sequential blocks are difficult to find. Current mining methods might individually find each motif block but fail to connect them with large irregular gaps. In this paper we propose a novel method for the efficient extraction of structured motifs from DNA sequences using multi-objective genetic algorithm. The main advantage of our approach is that a large number of nondominated motifs can be obtained by a single run with respect to conflicting objectives: similarity and support maximization and gap minimization. To the best of our knowledge, this is the first effort in this direction. The proposed method can be applied to any data set with a sequential character. Furthermore, it allows any choice of similarity measures for finding motifs. By analyzing the obtained optimal motifs, the decision maker can understand the tradeoff between the objectives. We compare our method with the two well-known structured motif extraction methods, EXMOTIF and RISOTTO. Experimental results on synthetics data set demonstrate that the proposed method exhibits good performance over the other methods in terms of runtime.
Keywords :
DNA; biology computing; data mining; feature extraction; genetic algorithms; sequences; DNA sequences; data mining; multiobjective genetic algorithm; structured motifs extraction; Bioinformatics; Biomedical engineering; DNA; Data engineering; Data mining; Genetic algorithms; Genetic engineering; Laboratories; Runtime; Sequences; multi-objective genetic algorithm; structured motif discovery;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer-Based Medical Systems, 2008. CBMS '08. 21st IEEE International Symposium on
Conference_Location :
Jyvaskyla
ISSN :
1063-7125
Print_ISBN :
978-0-7695-3165-6
Type :
conf
DOI :
10.1109/CBMS.2008.99
Filename :
4562001
Link To Document :
بازگشت