• DocumentCode
    3079275
  • Title

    A two stage approach for Contiguous Sequential Pattern mining

  • Author

    Chen, Jinlin ; Shankar, Subash ; Kelly, Angela ; Gningue, Serigne ; Rajaravivarma, Rathika

  • fYear
    2009
  • fDate
    10-12 Aug. 2009
  • Firstpage
    382
  • Lastpage
    387
  • Abstract
    Contiguous Sequential Pattern (CSP) mining is an important problem with many applications. Using general sequential pattern mining algorithms for CSP mining may lead to poor performance due to the lack of consideration on the contiguous property of CSP. In this paper we present a two stage approach for CSP mining. We first detect frequent itemsets in a database, based on which we partition the CSPs into subsets and apply a special data structure, General UpDown Tree, to detect all the patterns in each subset. The General Updown Tree exploits the contiguous property of CSPs to achieve a compact representation of all the sequences that contain an item. Such compact representation enables us to apply a top down approach for CSP mining and eliminates unnecessary candidate evaluation. Experiment results show that our approach is more efficient compared to previous approaches in terms of both time and space.
  • Keywords
    data mining; tree data structures; contiguous property; contiguous sequential pattern mining; data structure; top down approach; updown tree; Cities and towns; Data mining; Educational institutions; Itemsets; Partitioning algorithms; Spatial databases; Testing; Tree data structures; Contiguous sequential pattern; Data mining algorithm; Sequence database; Sequential pattern;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse & Integration, 2009. IRI '09. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-4244-4114-3
  • Electronic_ISBN
    978-1-4244-4116-7
  • Type

    conf

  • DOI
    10.1109/IRI.2009.5211583
  • Filename
    5211583