DocumentCode
272
Title
k-Pattern Set Mining under Constraints
Author
Guns, Tias ; Nijssen, Siegfried ; De Raedt, Luc
Author_Institution
Dept. of Comput. Sci., K.U. Leuven, Leuven, Belgium
Volume
25
Issue
2
fYear
2013
fDate
Feb. 2013
Firstpage
402
Lastpage
418
Abstract
We introduce the problem of k-pattern set mining, concerned with finding a set of k related patterns under constraints. This contrasts to regular pattern mining, where one searches for many individual patterns. The k-pattern set mining problem is a very general problem that can be instantiated to a wide variety of well-known mining tasks including concept-learning, rule-learning, redescription mining, conceptual clustering and tiling. To this end, we formulate a large number of constraints for use in k-pattern set mining, both at the local level, that is, on individual patterns, and on the global level, that is, on the overall pattern set. Building general solvers for the pattern set mining problem remains a challenge. Here, we investigate to what extent constraint programming (CP) can be used as a general solution strategy. We present a mapping of pattern set constraints to constraints currently available in CP. This allows us to investigate a large number of settings within a unified framework and to gain insight in the possibilities and limitations of these solvers. This is important as it allows us to create guidelines in how to model new problems successfully and how to model existing problems more efficiently. It also opens up the way for other solver technologies.
Keywords
constraint handling; data mining; learning (artificial intelligence); pattern clustering; CP; concept-learning; conceptual clustering; constraint programming; global level; k-pattern set mining; local level; pattern set constraint mapping; redescription mining; regular pattern mining; rule-learning; tiling; Accuracy; Data mining; Itemsets; Optimization; Size measurement; Data mining; constraint programming; constraints; pattern set mining;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2011.204
Filename
6035705
Link To Document