• DocumentCode
    3272076
  • Title

    Hot topic extraction using time window

  • Author

    Ma, Hui-fang

  • Author_Institution
    Dept. of Comput. Sci., Northwest Normal Univ., Lanzhou Gansu, China
  • Volume
    1
  • fYear
    2011
  • fDate
    10-13 July 2011
  • Firstpage
    56
  • Lastpage
    60
  • Abstract
    This paper presents a novel time window based hot topic extraction model for news stream. The model considers both pervasiveness and burst feature of topic terms. Pervasiveness is evaluated by terms´ occurrences reported from different channels and burst is assessed by terms´ abnormal occurrence frequencies from different time intervals. An energy ratio threshold based approach for burst detection is adopted and time window is introduced for news text stream analysis. TF-PDF is then combined to weigh the terms. Experiment results demonstrate that our model is effective in topic extraction for news texts.
  • Keywords
    information retrieval systems; text analysis; burst detection; burst feature; electronic news document archives; energy ratio threshold based approach; hot topic extraction model; news text stream analysis; pervasiveness feature; term abnormal occurrence frequencies; time window; Cybernetics; Data mining; Educational institutions; Feature extraction; Heuristic algorithms; Machine learning; Machine learning algorithms; Burst Detection; Data Stream; Hot Topic; TF-PDF; Time Window;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
  • Conference_Location
    Guilin
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4577-0305-8
  • Type

    conf

  • DOI
    10.1109/ICMLC.2011.6016664
  • Filename
    6016664