DocumentCode :
2064866
Title :
Hot topic detection in Chinese web forum using statistics approach
Author :
Li, Xiaoyu ; Dai, Guanzhong ; Lai, Shuang ; Dai, Hang
Author_Institution :
Coll. of Autom., Northwestern Polytech. Univ., Xian, China
fYear :
2011
fDate :
14-16 Sept. 2011
Firstpage :
1
Lastpage :
4
Abstract :
In this paper we propose a statistics approach for hot topic detection in Chinese web forum. In order to solve the fundamental obstacles of Chinese web data mining, such as new words, nonstandard syntax and Chinese word segmentation, we present the longest common segmented consecutive subsequence (LCSCS) and other techniques. The algorithm can run even without prior knowledge. Our experiments show the satisfying results both in performance and quality.
Keywords :
Web sites; data mining; statistical analysis; Chinese Web data mining; Chinese Web forum; hot topic detection; longest common segmented consecutive subsequence; statistics approach; Association rules; Complexity theory; Conferences; Heuristic algorithms; Syntactics; Writing; Chinese web forum; hot topic detection; web data mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, Communications and Computing (ICSPCC), 2011 IEEE International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4577-0893-0
Type :
conf
DOI :
10.1109/ICSPCC.2011.6061621
Filename :
6061621
Link To Document :
بازگشت