DocumentCode
244902
Title
Sequence Classification Based on Delta-Free Sequential Patterns
Author
Holat, Pierre ; Plantevit, Marc ; Raissi, Chedy ; Tomeh, Nadi ; Charnois, Thierry ; Cremilleux, Bruno
Author_Institution
Univ. Paris 13, Paris, France
fYear
2014
fDate
14-17 Dec. 2014
Firstpage
170
Lastpage
179
Abstract
Sequential pattern mining is one of the most studied and challenging tasks in data mining. However, the extension of well-known methods from many other classical patterns to sequences is not a trivial task. In this paper we study the notion of δ-freeness for sequences. While this notion has extensively been discussed for itemsets, this work is the first to extend it to sequences. We define an efficient algorithm devoted to the extraction of δ-free sequential patterns. Furthermore, we show the advantage of the δ-free sequences and highlight their importance when building sequence classifiers, and we show how they can be used to address the feature selection problem in statistical classifiers, as well as to build symbolic classifiers which optimizes both accuracy and earliness of predictions.
Keywords
data mining; feature selection; pattern classification; statistical analysis; δ-free sequential patterns; δ-freeness; data mining; delta-free sequential patterns; feature selection problem; itemsets; sequence classification; sequential pattern mining; statistical classifiers; symbolic classifiers; Accuracy; Data mining; Feature extraction; Generators; Itemsets; Runtime; early classification; feature selection; free patterns; sequence mining; text classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2014 IEEE International Conference on
Conference_Location
Shenzhen
ISSN
1550-4786
Print_ISBN
978-1-4799-4303-6
Type
conf
DOI
10.1109/ICDM.2014.154
Filename
7023334
Link To Document