DocumentCode :
2382955
Title :
Hot keyword identification for extracting web public opinion
Author :
Fang, Zhiqi ; Ning, Yue ; Zhu, Tingshao
Author_Institution :
Nat. Comput. Syst. Eng. Res. Inst. of China, Beijing, China
fYear :
2010
fDate :
1-3 Dec. 2010
Firstpage :
116
Lastpage :
121
Abstract :
Internet is becoming an increasingly important platform for ordinary life and work. It is expected that keyword extraction can help people quickly find hot spots on the web, since keywords in a document provide important information about the content of the document. In this paper, we propose to use text clustering method based on semi-supervised learning to get focuses of social topics in a large amount of text. We develop a novel keyword extraction method named NATF-PDF, which is based on TFPDF algorithm, combined with supervised learning theory for keyword extraction. We compare its performance with TFIDF in comparison, and the results show that our method get better accuracy and recall ratio.
Keywords :
Internet; information retrieval; learning (artificial intelligence); text analysis; Internet; NATF-PDF; TFPDF algorithm; Web public opinion extraction; hot keyword identification; keyword extraction method; semi-supervised learning; supervised learning theory; text clustering method; Clustering; Keyword Extraction; NATF-PDF; Semi-supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pervasive Computing and Applications (ICPCA), 2010 5th International Conference on
Conference_Location :
Maribor
Print_ISBN :
978-1-4244-9144-5
Type :
conf
DOI :
10.1109/ICPCA.2010.5704085
Filename :
5704085
Link To Document :
بازگشت