• DocumentCode
    2319257
  • Title

    Detecting hot topics in technology news streams

  • Author

    You, Bo ; Liu, Ming ; Liu, Bing-quan ; Wang, Xiao-long

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
  • Volume
    5
  • fYear
    2012
  • fDate
    15-17 July 2012
  • Firstpage
    1968
  • Lastpage
    1974
  • Abstract
    Detecting hot topics with a fine granularity in technology news streams is an interesting and important problem given the large amount of reports and a relatively narrow range of topics. In this paper, a three-phase method is proposed. In the first phase, the document topic distribution vector is generated and keywords are extracted for each document using topic model pachinko allocation. In the second phase, the documents are clustered based on the document topic distribution vector obtained from the previous phase using affinity propagation. And in the last phase, actual events denoted by combinations of keywords within each cluster are found out using frequent pattern mining algorithms. We evaluate our approach on a collection of technology news reports from various sites in a fixed time period. T he results show that this method is effective.
  • Keywords
    data mining; document handling; information resources; affinity propagation; document topic distribution vector; hot topics detection; pattern mining; technology news streams; topic model pachinko allocation; Abstracts; Merging; Document clustering; Frequent pattern mining; Hot topic; Technology news streams; Topic model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
  • Conference_Location
    Xian
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4673-1484-8
  • Type

    conf

  • DOI
    10.1109/ICMLC.2012.6359678
  • Filename
    6359678