DocumentCode
2860073
Title
Automatic Pattern-Taxonomy Extraction for Web Mining
Author
Wu, Sheng-Tang ; Li, Yuefeng ; Xu, Yue ; Pham, Binh ; Chen, Phoebe
Author_Institution
Queensland University of Technology, Australia
fYear
2004
fDate
20-24 Sept. 2004
Firstpage
242
Lastpage
248
Abstract
In this paper, we propose a model for discovering frequent sequential patterns, phrases, which can be used as profile descriptors of documents. It is indubitable that we can obtain numerous phrases using data mining algorithms. However, it is difficult to use these phrases effectively for answering what users want. Therefore, we present a pattern taxonomy extraction model which performs the task of extracting descriptive frequent sequential patterns by pruning the meaningless ones. The model then is extended and tested by applying it to the information filtering system. The results of the experiment show that pattern-based methods outperform the keyword-based methods. The results also indicate that removal of meaningless patterns not only reduces the cost of computation but also improves the effectiveness of the system.
Keywords
Association rules; Data communication; Data mining; Data processing; Information filtering; Information filters; Software engineering; Taxonomy; Text mining; Web mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
Print_ISBN
0-7695-2100-2
Type
conf
DOI
10.1109/WI.2004.10132
Filename
1410810
Link To Document