• DocumentCode
    3247111
  • Title

    Design and Implementation of an Algorithm for Finding Frequent Sequential Traversal Patterns from Web Logs Based on Weight Constraint

  • Author

    Sisodia, Mahendra Singh ; Pathak, Mayank ; Verma, Bhupendra ; Nigam, Rajesh K.

  • Author_Institution
    Oriental Inst. of Sci. & Technol., Bhopal, India
  • fYear
    2009
  • fDate
    16-18 Dec. 2009
  • Firstpage
    317
  • Lastpage
    322
  • Abstract
    Many frequent sequential traversal pattern mining algorithms have been developed which mine the set of frequent subsequences traversal pattern satisfying a minimum support constraint in a session database. However, previous frequent sequential traversal pattern mining algorithms give equal weightage to sequential traversal patterns while the pages in sequential traversal patterns have different importance and have different weightage. Another main problem in most of the frequent sequential traversal pattern mining algorithms is that they produce a large number of sequential traversal patterns when a minimum support is lowered and they do not provide alternative ways to adjust the number of sequential traversal patterns other than increasing the minimum support. In this paper, we propose a frequent sequential traversal pattern mining algorithm with weights constraint. Our main approach is to add the weight constraints into the sequential traversal pattern while maintaining the downward closure property. A weight range is defined to maintain the downward closure property and pages are given different weights and traversal sequences assign a minimum and maximum weight. In scanning a session database, a maximum and minimum weight in the session database is used to prune infrequent sequential traversal subsequence by doing downward closure property can be maintained. Our method produces a few but important sequential traversal patterns in session databases with a low minimum support, by adjusting a weight range of pages and sequence.
  • Keywords
    Internet; data mining; database management systems; information retrieval; Web logs; Web usage mining; downward closure property maintenance; frequent sequential traversal pattern mining algorithms; session database; weight constraint; Algorithm design and analysis; Computer science; Data engineering; Data mining; Databases; Design engineering; Web mining; Web pages; Web server; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Trends in Engineering and Technology (ICETET), 2009 2nd International Conference on
  • Conference_Location
    Nagpur
  • Print_ISBN
    978-1-4244-5250-7
  • Electronic_ISBN
    978-0-7695-3884-6
  • Type

    conf

  • DOI
    10.1109/ICETET.2009.70
  • Filename
    5395407