• DocumentCode
    2099836
  • Title

    Efficient Sequential Pattern Mining Algorithm by Positional Data

  • Author

    Jin, Sha ; Yingxin, Hu ; Lianjuan, Jia

  • Author_Institution
    Dept. of Inf. Sci. & Technol., Shijiazhuang Tiedao Univ., Shijiazhuang, China
  • fYear
    2011
  • fDate
    17-18 Sept. 2011
  • Firstpage
    419
  • Lastpage
    422
  • Abstract
    The CloSpan algorithm first suggested that the closed set of sequential patterns is more compact and has the same expressive power with respect to the full set. Based on the Prefix Span algorithm, CloSpan added two pruning techniques, backward sub-pattern and backward super-pattern, to efficiently mine the closed set. This paper proposed a new closed sequential pattern mining algorithm. However, instead of depth-first searching used in many previous methods, we adopt a breadth-first approach. Besides, previous methods seldom utilize the property of item ordering to enhance efficiency. We used a list of positional data to reserve the information of item ordering. By using these positional data, we developed two main pruning techniques, backward super pattern condition and same positional data condition. To ensure correct and compact resulted lattice, we also manipulated some special conditions. From the experimental results, our algorithm outperforms CloSpan in the cases of moderately large datasets and low support threshold.
  • Keywords
    data mining; pattern recognition; CloSpan algorithm; breadth-first approach; positional data; prefix span algorithm; pruning techniques; sequential pattern mining; Algorithm design and analysis; Data mining; Databases; Educational institutions; Generators; Information science; Lattices; backward super-patter; closed sequential pattern; data mining; sequential pattern;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Internet Computing & Information Services (ICICIS), 2011 International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4577-1561-7
  • Type

    conf

  • DOI
    10.1109/ICICIS.2011.109
  • Filename
    6063286