• DocumentCode
    3130158
  • Title

    Summarizing Contrasts by Recursive Pattern Mining

  • Author

    Soulet, Arnaud ; Crémilleux, Bruno ; Plantevit, Marc

  • Author_Institution
    LI, Univ. of Tours, Blois, France
  • fYear
    2011
  • fDate
    11-11 Dec. 2011
  • Firstpage
    1155
  • Lastpage
    1162
  • Abstract
    A lot of constrained patterns (e.g., emerging patterns, subgroup discovery, classification rules) emphasize the contrasts between data classes and are at the core of many classification techniques. Nevertheless, the extremely large collection of generated patterns hampers the end-user interpretation and the deep understanding of the knowledge revealed by the whole collection of patterns. The key idea of this paper is to summarize the contrasts of a dataset in order to provide understandable characterizations of data classes. We first introduce a novel framework, called recursive pattern mining, for only discovering few as well as relevant patterns. We demonstrate that this approach encompasses usual pattern mining framework and we study its key properties. Then, we use recursive pattern mining for extracting k recursive emerging patterns. Taken together, these patterns form a REP k-summary which summarizes the contrasts of the dataset. Finally, we validate our approach on benchmarks and real-world applications on the biological domain, showing the efficiency and the usefulness of the approach.
  • Keywords
    data mining; pattern classification; REP k-summary; contrast summarization; data classes; end-user interpretation; pattern classification technique; recursive pattern mining; Biology; Conferences; Context; Data mining; Educational institutions; Itemsets; contrasts; pattern mining; summary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    978-1-4673-0005-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2011.161
  • Filename
    6137511