• DocumentCode
    2922356
  • Title

    pSPADE : Mining sequential pattern using personalized support threshold value

  • Author

    Alias, Suraya ; Norwawi, Norita Md

  • Author_Institution
    College of Arts and Sciences, Universiti Utara Malaysia, Malaysia
  • Volume
    2
  • fYear
    2008
  • fDate
    26-28 Aug. 2008
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    As the web log data is considered as complex and temporal, applying Sequential Pattern Mining technique becomes a challenging task. The min sup threshold issue is highlighted - as a pattern is considered as frequent if it meets the specified min sup. If the min sup is high, few patterns are discovered else the mining process will be longer if too many patterns generated using low min sup. The format of web log data that creates consecutive occurring pages has made it difficult to generate frequent sequences. Also, as each user’ behaviour is unique; using one min sup value for all users may affect the pattern generation. This research introduced a personalized minimum support threshold for each web users using their Median item access (support) value to curb this problem. The pSPADE performance was the highest on the discovery of user’s origin and also interesting pattern discovery attribute.
  • Keywords
    Algorithm design and analysis; Art; Databases; Delay; Educational institutions; Navigation; Pattern analysis; Web page design; Web server; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology, 2008. ITSim 2008. International Symposium on
  • Conference_Location
    Kuala Lumpur, Malaysia
  • Print_ISBN
    978-1-4244-2327-9
  • Electronic_ISBN
    978-1-4244-2328-6
  • Type

    conf

  • DOI
    10.1109/ITSIM.2008.4631672
  • Filename
    4631672