DocumentCode
2734131
Title
Deploying Top-k Specific Patterns for Relevance Feature Discovery
Author
Pipanmaekaporn, Luepol ; Li, Yuefeng ; Geva, Shlomo
Author_Institution
Fac. of Sci. & Technol., Queensland Univ. of Technol., Brisbane, QLD, Australia
Volume
3
fYear
2010
fDate
Aug. 31 2010-Sept. 3 2010
Firstpage
318
Lastpage
321
Abstract
The quality of discovered patterns is important for relevance feature discovery in text documents because frequent pattern mining often produces many noisy patterns. In this paper, we propose a novel method for the summarization of discovered patterns in text documents, finding a smaller number of specific patterns for representing the large number of discovered patterns for a given topic. We also evaluate the proposed method by implementing a new pattern-based information filtering model. The experimental results show that the proposed method not only outperforms both the term-based approaches and pattern based approaches, but largely reduces the number of feature terms as well.
Keywords
data mining; information filtering; text analysis; discovered pattern summarization; frequent pattern mining; pattern based approaches; pattern-based information filtering model; relevance feature discovery; term-based approaches; text documents; top-k specific pattern quality; Conferences; Feature extraction; Frequency measurement; Ontologies; Taxonomy; Text mining; Training; Information Filtering; Pattern Mining; Relevance Feature Discovery; Text Mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010 IEEE/WIC/ACM International Conference on
Conference_Location
Toronto, ON
Print_ISBN
978-1-4244-8482-9
Electronic_ISBN
978-0-7695-4191-4
Type
conf
DOI
10.1109/WI-IAT.2010.194
Filename
5614182
Link To Document