• DocumentCode
    3645313
  • Title

    A Constraint Programming Approach for Enumerating Motifs in a Sequence

  • Author

    Emmanuel Coquery;Saïd Jabbour;Lakhdar Saïs

  • Author_Institution
    LIRIS, Univ. Claude Bernard Lyon 1, Villeurbanne, France
  • fYear
    2011
  • Firstpage
    1091
  • Lastpage
    1097
  • Abstract
    In this paper we propose a constraint programming approach for enumerating all frequent patterns with wildcards in a given sequence. To reduce the search space, we show that the anti-monotonicity property of frequent patterns can be dynamically encoded using no good recording based approach. Finally, the constraints network is encoded as a Boolean formula. This last step allows us to exploit the efficiency of modern SAT solvers and particularly their clauses learning component. Preliminary experiments on real world data show the feasibility of our approach in practice.
  • Keywords
    "Programming","Data mining","Databases","Encoding","Solids","Proteins","Electronic mail"
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
  • Print_ISBN
    978-1-4673-0005-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2011.10
  • Filename
    6137502