DocumentCode
2319257
Title
Detecting hot topics in technology news streams
Author
You, Bo ; Liu, Ming ; Liu, Bing-quan ; Wang, Xiao-long
Author_Institution
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Volume
5
fYear
2012
fDate
15-17 July 2012
Firstpage
1968
Lastpage
1974
Abstract
Detecting hot topics with a fine granularity in technology news streams is an interesting and important problem given the large amount of reports and a relatively narrow range of topics. In this paper, a three-phase method is proposed. In the first phase, the document topic distribution vector is generated and keywords are extracted for each document using topic model pachinko allocation. In the second phase, the documents are clustered based on the document topic distribution vector obtained from the previous phase using affinity propagation. And in the last phase, actual events denoted by combinations of keywords within each cluster are found out using frequent pattern mining algorithms. We evaluate our approach on a collection of technology news reports from various sites in a fixed time period. T he results show that this method is effective.
Keywords
data mining; document handling; information resources; affinity propagation; document topic distribution vector; hot topics detection; pattern mining; technology news streams; topic model pachinko allocation; Abstracts; Merging; Document clustering; Frequent pattern mining; Hot topic; Technology news streams; Topic model;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location
Xian
ISSN
2160-133X
Print_ISBN
978-1-4673-1484-8
Type
conf
DOI
10.1109/ICMLC.2012.6359678
Filename
6359678
Link To Document