• DocumentCode
    2371003
  • Title

    Combining the Web content and usage mining to understand the visitor behavior in a Web site

  • Author

    Velásquez, Juan ; Yasuda, Hiroshi ; Aoki, Terumasa

  • Author_Institution
    Res. Center for Adv. Sci. & Technol., Tokyo Univ., Japan
  • fYear
    2003
  • fDate
    19-22 Nov. 2003
  • Firstpage
    669
  • Lastpage
    672
  • Abstract
    A Web site is a semi structured collection of different kinds of data, whose motivation is to show relevant information to a visitor and in this way capture her/his attention. Understanding the specific preferences that define the visitor behavior in a Web site is a complex task. An approximation is supposed that depends on the content, navigation sequence and time spent in each page visited. These variables can be extracted from the Web log files and the Web site itself, using Web usage and content mining respectively. Combining the described variables, a similarity measure among visitor sessions is introduced and used in a clustering algorithm, which identifies groups of similar sessions, allowing the analysis of visitor behavior. In order to prove the methodology´s effectiveness, it was applied in a certain Web site, showing the benefits of the described approach.
  • Keywords
    Web sites; content management; data mining; information retrieval; self-organising feature maps; Web content; Web log file; Web site; Web usage; clustering algorithm; content mining; navigation sequence; visitor behavior analysis; Algorithm design and analysis; Clustering algorithms; Data mining; Electronic mail; Internet; Navigation; Time measurement; Web mining; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
  • Print_ISBN
    0-7695-1978-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2003.1251004
  • Filename
    1251004