Title of article :
Fast discovery of sequential patterns in large databases using effective time-indexing
Author/Authors :
Ming-Yen Lin، نويسنده , , Sue-Chen Hsueh، نويسنده , , Chia-Wen Chang، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2008
Pages :
18
From page :
4228
To page :
4245
Abstract :
Sequential pattern mining algorithms can often produce more accurate results if they work with specific constraints in addition to the support threshold. Many systems implement time-independent constraints by selecting qualified patterns. This selection cannot implement time-dependent constraints, because the support computation process must validate the time attributes of every data sequence during mining. Therefore, we propose a memory time-indexing approach, called METISP, to discover sequential patterns with time constraints including minimum-gap, maximum-gap, exact-gap, sliding window, and duration constraints. METISP scans the database into memory and constructs time-index sets for effective processing. METISP uses index sets and a pattern-growth strategy to mine patterns without generating any candidates or sub-databases. The index sets narrow down the search space to the sets of designated in-memory data sequences, and speed up the counting of potential items within the indicated ranges. Our comprehensive experiments show that METISP has better efficiency, even with low support and large databases, than the well-known GSP and DELISP algorithms. METISP scales up linearly with respect to database size.
Keywords :
Time constraint , Time-index , Sequence mining , Sequential patterns , Pattern-growth
Journal title :
Information Sciences
Serial Year :
2008
Journal title :
Information Sciences
Record number :
1213450
Link To Document :
بازگشت