DocumentCode
3645313
Title
A Constraint Programming Approach for Enumerating Motifs in a Sequence
Author
Emmanuel Coquery;Saïd Jabbour;Lakhdar Saïs
Author_Institution
LIRIS, Univ. Claude Bernard Lyon 1, Villeurbanne, France
fYear
2011
Firstpage
1091
Lastpage
1097
Abstract
In this paper we propose a constraint programming approach for enumerating all frequent patterns with wildcards in a given sequence. To reduce the search space, we show that the anti-monotonicity property of frequent patterns can be dynamically encoded using no good recording based approach. Finally, the constraints network is encoded as a Boolean formula. This last step allows us to exploit the efficiency of modern SAT solvers and particularly their clauses learning component. Preliminary experiments on real world data show the feasibility of our approach in practice.
Keywords
"Programming","Data mining","Databases","Encoding","Solids","Proteins","Electronic mail"
Publisher
ieee
Conference_Titel
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Print_ISBN
978-1-4673-0005-6
Type
conf
DOI
10.1109/ICDMW.2011.10
Filename
6137502
Link To Document