• DocumentCode
    244902
  • Title

    Sequence Classification Based on Delta-Free Sequential Patterns

  • Author

    Holat, Pierre ; Plantevit, Marc ; Raissi, Chedy ; Tomeh, Nadi ; Charnois, Thierry ; Cremilleux, Bruno

  • Author_Institution
    Univ. Paris 13, Paris, France
  • fYear
    2014
  • fDate
    14-17 Dec. 2014
  • Firstpage
    170
  • Lastpage
    179
  • Abstract
    Sequential pattern mining is one of the most studied and challenging tasks in data mining. However, the extension of well-known methods from many other classical patterns to sequences is not a trivial task. In this paper we study the notion of δ-freeness for sequences. While this notion has extensively been discussed for itemsets, this work is the first to extend it to sequences. We define an efficient algorithm devoted to the extraction of δ-free sequential patterns. Furthermore, we show the advantage of the δ-free sequences and highlight their importance when building sequence classifiers, and we show how they can be used to address the feature selection problem in statistical classifiers, as well as to build symbolic classifiers which optimizes both accuracy and earliness of predictions.
  • Keywords
    data mining; feature selection; pattern classification; statistical analysis; δ-free sequential patterns; δ-freeness; data mining; delta-free sequential patterns; feature selection problem; itemsets; sequence classification; sequential pattern mining; statistical classifiers; symbolic classifiers; Accuracy; Data mining; Feature extraction; Generators; Itemsets; Runtime; early classification; feature selection; free patterns; sequence mining; text classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4799-4303-6
  • Type

    conf

  • DOI
    10.1109/ICDM.2014.154
  • Filename
    7023334