Title :
A Constraint Programming Approach for Enumerating Motifs in a Sequence
Author :
Emmanuel Coquery;Saïd Jabbour;Lakhdar Saïs
Author_Institution :
LIRIS, Univ. Claude Bernard Lyon 1, Villeurbanne, France
Abstract :
In this paper we propose a constraint programming approach for enumerating all frequent patterns with wildcards in a given sequence. To reduce the search space, we show that the anti-monotonicity property of frequent patterns can be dynamically encoded using no good recording based approach. Finally, the constraints network is encoded as a Boolean formula. This last step allows us to exploit the efficiency of modern SAT solvers and particularly their clauses learning component. Preliminary experiments on real world data show the feasibility of our approach in practice.
Keywords :
"Programming","Data mining","Databases","Encoding","Solids","Proteins","Electronic mail"
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Print_ISBN :
978-1-4673-0005-6
DOI :
10.1109/ICDMW.2011.10