Title :
Hot keyword identification for extracting web public opinion
Author :
Fang, Zhiqi ; Ning, Yue ; Zhu, Tingshao
Author_Institution :
Nat. Comput. Syst. Eng. Res. Inst. of China, Beijing, China
Abstract :
Internet is becoming an increasingly important platform for ordinary life and work. It is expected that keyword extraction can help people quickly find hot spots on the web, since keywords in a document provide important information about the content of the document. In this paper, we propose to use text clustering method based on semi-supervised learning to get focuses of social topics in a large amount of text. We develop a novel keyword extraction method named NATF-PDF, which is based on TFPDF algorithm, combined with supervised learning theory for keyword extraction. We compare its performance with TFIDF in comparison, and the results show that our method get better accuracy and recall ratio.
Keywords :
Internet; information retrieval; learning (artificial intelligence); text analysis; Internet; NATF-PDF; TFPDF algorithm; Web public opinion extraction; hot keyword identification; keyword extraction method; semi-supervised learning; supervised learning theory; text clustering method; Clustering; Keyword Extraction; NATF-PDF; Semi-supervised learning;
Conference_Titel :
Pervasive Computing and Applications (ICPCA), 2010 5th International Conference on
Conference_Location :
Maribor
Print_ISBN :
978-1-4244-9144-5
DOI :
10.1109/ICPCA.2010.5704085