DocumentCode :
3316986
Title :
Approximate Sequential Patterns for Incomplete Sequence Database Mining
Author :
Fiot, Céline ; Laurent, Anne ; Teisseire, Maguelonne
Author_Institution :
Univ. of Montpellier II - CNRS, Montpellier
fYear :
2007
fDate :
23-26 July 2007
Firstpage :
1
Lastpage :
6
Abstract :
Databases available from many industrial or research fields are often imperfect. In particular, they are most of the time incomplete in the sense that some of the values are missing. When facing this kind of imperfect data, two techniques can be investigated: either using only the available information or estimating the missing values. In this paper we propose an estimation-based approach for sequence mining. This approach considers partial inclusion of an item within a record using fuzzy sets. Experiments run on various synthetic datasets show the feasibility and validity of our proposal as well in terms of quality as in terms of the robustness to the rate of missing values.
Keywords :
approximation theory; data mining; database management systems; fuzzy set theory; pattern classification; sequences; sequential estimation; approximate sequential patterns; fuzzy sets; incomplete sequence database mining; missing data estimation; synthetic datasets; Association rules; Data analysis; Data mining; Databases; Fuzzy sets; Information analysis; Mining industry; Pattern analysis; Proposals; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems Conference, 2007. FUZZ-IEEE 2007. IEEE International
Conference_Location :
London
ISSN :
1098-7584
Print_ISBN :
1-4244-1209-9
Electronic_ISBN :
1098-7584
Type :
conf
DOI :
10.1109/FUZZY.2007.4295445
Filename :
4295445
Link To Document :
بازگشت