• DocumentCode
    2860073
  • Title

    Automatic Pattern-Taxonomy Extraction for Web Mining

  • Author

    Wu, Sheng-Tang ; Li, Yuefeng ; Xu, Yue ; Pham, Binh ; Chen, Phoebe

  • Author_Institution
    Queensland University of Technology, Australia
  • fYear
    2004
  • fDate
    20-24 Sept. 2004
  • Firstpage
    242
  • Lastpage
    248
  • Abstract
    In this paper, we propose a model for discovering frequent sequential patterns, phrases, which can be used as profile descriptors of documents. It is indubitable that we can obtain numerous phrases using data mining algorithms. However, it is difficult to use these phrases effectively for answering what users want. Therefore, we present a pattern taxonomy extraction model which performs the task of extracting descriptive frequent sequential patterns by pruning the meaningless ones. The model then is extended and tested by applying it to the information filtering system. The results of the experiment show that pattern-based methods outperform the keyword-based methods. The results also indicate that removal of meaningless patterns not only reduces the cost of computation but also improves the effectiveness of the system.
  • Keywords
    Association rules; Data communication; Data mining; Data processing; Information filtering; Information filters; Software engineering; Taxonomy; Text mining; Web mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2100-2
  • Type

    conf

  • DOI
    10.1109/WI.2004.10132
  • Filename
    1410810