• DocumentCode
    3563623
  • Title

    Discovery of various sequential patterns within top-k from sequential data

  • Author

    Sakurai, Shigeaki ; Nishizawa, Minoru

  • Author_Institution
    Big Data Cloud Technol. Center, Toshiba Corp. Cloud & Solutions Co., Kanagawa, Japan
  • fYear
    2014
  • Firstpage
    446
  • Lastpage
    451
  • Abstract
    This paper proposes a method that discovers various sequential patterns from sequential data. The sequential data is a set of sequences. Each sequence is a row of item sets. Many previous methods discover frequent sequential patterns from the data. However, the patterns tend to be similar to each other because they are composed of limited items. The patterns do not always correspond to the interests of analysts. Therefore, this paper tackles on the issue discovering various sequential patterns. The proposed method discovers them by evaluating the variety of items and deleting redundant patterns based on three kinds of delete processes. It can discover various patterns within the upper bound for the number of sequential patterns given by the analysts. This paper applies the method to the synthetic sequential data which is characterized by number of items, their kind, and length of sequence. The effect of the method is verified through numerical experiments.
  • Keywords
    data handling; pattern recognition; frequent sequential patterns discovery; item sets; synthetic sequential data; top-k; Bars; Big data; Companies; Dairy products; Noise; Time measurement; Upper bound; Delete of redundant patterns; Sequential Pattern; Upper pattern constraint; Variety of items;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Intelligent Systems (SCIS), 2014 Joint 7th International Conference on and Advanced Intelligent Systems (ISIS), 15th International Symposium on
  • Type

    conf

  • DOI
    10.1109/SCIS-ISIS.2014.7044644
  • Filename
    7044644