DocumentCode :
3645313
Title :
A Constraint Programming Approach for Enumerating Motifs in a Sequence
Author :
Emmanuel Coquery;Saïd Jabbour;Lakhdar Saïs
Author_Institution :
LIRIS, Univ. Claude Bernard Lyon 1, Villeurbanne, France
fYear :
2011
Firstpage :
1091
Lastpage :
1097
Abstract :
In this paper we propose a constraint programming approach for enumerating all frequent patterns with wildcards in a given sequence. To reduce the search space, we show that the anti-monotonicity property of frequent patterns can be dynamically encoded using no good recording based approach. Finally, the constraints network is encoded as a Boolean formula. This last step allows us to exploit the efficiency of modern SAT solvers and particularly their clauses learning component. Preliminary experiments on real world data show the feasibility of our approach in practice.
Keywords :
"Programming","Data mining","Databases","Encoding","Solids","Proteins","Electronic mail"
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Print_ISBN :
978-1-4673-0005-6
Type :
conf
DOI :
10.1109/ICDMW.2011.10
Filename :
6137502
Link To Document :
بازگشت