DocumentCode
3130158
Title
Summarizing Contrasts by Recursive Pattern Mining
Author
Soulet, Arnaud ; Crémilleux, Bruno ; Plantevit, Marc
Author_Institution
LI, Univ. of Tours, Blois, France
fYear
2011
fDate
11-11 Dec. 2011
Firstpage
1155
Lastpage
1162
Abstract
A lot of constrained patterns (e.g., emerging patterns, subgroup discovery, classification rules) emphasize the contrasts between data classes and are at the core of many classification techniques. Nevertheless, the extremely large collection of generated patterns hampers the end-user interpretation and the deep understanding of the knowledge revealed by the whole collection of patterns. The key idea of this paper is to summarize the contrasts of a dataset in order to provide understandable characterizations of data classes. We first introduce a novel framework, called recursive pattern mining, for only discovering few as well as relevant patterns. We demonstrate that this approach encompasses usual pattern mining framework and we study its key properties. Then, we use recursive pattern mining for extracting k recursive emerging patterns. Taken together, these patterns form a REP k-summary which summarizes the contrasts of the dataset. Finally, we validate our approach on benchmarks and real-world applications on the biological domain, showing the efficiency and the usefulness of the approach.
Keywords
data mining; pattern classification; REP k-summary; contrast summarization; data classes; end-user interpretation; pattern classification technique; recursive pattern mining; Biology; Conferences; Context; Data mining; Educational institutions; Itemsets; contrasts; pattern mining; summary;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location
Vancouver, BC
Print_ISBN
978-1-4673-0005-6
Type
conf
DOI
10.1109/ICDMW.2011.161
Filename
6137511
Link To Document