• DocumentCode
    2382955
  • Title

    Hot keyword identification for extracting web public opinion

  • Author

    Fang, Zhiqi ; Ning, Yue ; Zhu, Tingshao

  • Author_Institution
    Nat. Comput. Syst. Eng. Res. Inst. of China, Beijing, China
  • fYear
    2010
  • fDate
    1-3 Dec. 2010
  • Firstpage
    116
  • Lastpage
    121
  • Abstract
    Internet is becoming an increasingly important platform for ordinary life and work. It is expected that keyword extraction can help people quickly find hot spots on the web, since keywords in a document provide important information about the content of the document. In this paper, we propose to use text clustering method based on semi-supervised learning to get focuses of social topics in a large amount of text. We develop a novel keyword extraction method named NATF-PDF, which is based on TFPDF algorithm, combined with supervised learning theory for keyword extraction. We compare its performance with TFIDF in comparison, and the results show that our method get better accuracy and recall ratio.
  • Keywords
    Internet; information retrieval; learning (artificial intelligence); text analysis; Internet; NATF-PDF; TFPDF algorithm; Web public opinion extraction; hot keyword identification; keyword extraction method; semi-supervised learning; supervised learning theory; text clustering method; Clustering; Keyword Extraction; NATF-PDF; Semi-supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pervasive Computing and Applications (ICPCA), 2010 5th International Conference on
  • Conference_Location
    Maribor
  • Print_ISBN
    978-1-4244-9144-5
  • Type

    conf

  • DOI
    10.1109/ICPCA.2010.5704085
  • Filename
    5704085