DocumentCode
2382955
Title
Hot keyword identification for extracting web public opinion
Author
Fang, Zhiqi ; Ning, Yue ; Zhu, Tingshao
Author_Institution
Nat. Comput. Syst. Eng. Res. Inst. of China, Beijing, China
fYear
2010
fDate
1-3 Dec. 2010
Firstpage
116
Lastpage
121
Abstract
Internet is becoming an increasingly important platform for ordinary life and work. It is expected that keyword extraction can help people quickly find hot spots on the web, since keywords in a document provide important information about the content of the document. In this paper, we propose to use text clustering method based on semi-supervised learning to get focuses of social topics in a large amount of text. We develop a novel keyword extraction method named NATF-PDF, which is based on TFPDF algorithm, combined with supervised learning theory for keyword extraction. We compare its performance with TFIDF in comparison, and the results show that our method get better accuracy and recall ratio.
Keywords
Internet; information retrieval; learning (artificial intelligence); text analysis; Internet; NATF-PDF; TFPDF algorithm; Web public opinion extraction; hot keyword identification; keyword extraction method; semi-supervised learning; supervised learning theory; text clustering method; Clustering; Keyword Extraction; NATF-PDF; Semi-supervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Pervasive Computing and Applications (ICPCA), 2010 5th International Conference on
Conference_Location
Maribor
Print_ISBN
978-1-4244-9144-5
Type
conf
DOI
10.1109/ICPCA.2010.5704085
Filename
5704085
Link To Document