DocumentCode
1784033
Title
Research on Topic Detection Strategy Based on Extension of Comments and HowNet Lexeme
Author
Yun Liu ; Xiaoxian Li ; Bing Zhao
Author_Institution
Dept. of Electron. & Inf. Eng., Beijing Jiaotong Univ., Beijing, China
fYear
2014
fDate
27-29 Aug. 2014
Firstpage
920
Lastpage
923
Abstract
As a product of Web2.0, micro-blog is developing rapidly these years. More and more information spread on the micro-blog because of its high speed and convenience, social hotspots and news events included. As a result, discovering, extraction and analyzing information become researching hotspots. By studying micro-blog text and long text cluster, this article draws a conclusion that traditional cluster algorithms cannot be used to discover topics because of the length of text. Therefore, this article proposes a solution which is based on the extension of the comments and HowNet lexeme. By this method, the short text and diversified expression can be overcome. Finally, the simulation results show that the proposed algorithm would significantly diminish the bad effects which are the results of short-text and improve the accuracy of clustering results.
Keywords
Internet; pattern clustering; social networking (online); text analysis; HowNet Lexeme; Web 2.0; comment extension; microblog text cluster; short-text cluster; social hotspots; topic detection strategy; Accuracy; Clustering algorithms; Data mining; Internet; Probability; Semantics; Vectors; Microblogging short text; clustering algorithm; hot topics;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), 2014 Tenth International Conference on
Conference_Location
Kitakyushu
Print_ISBN
978-1-4799-5389-9
Type
conf
DOI
10.1109/IIH-MSP.2014.231
Filename
6998477
Link To Document