DocumentCode :
1901263
Title :
A Topic Detection and Tracking Method Combining NLP with Suffix Tree Clustering
Author :
Jin, Yaohong
Author_Institution :
Inst. of Chinese Inf. Process., Beijing Normal Univ., Beijing, China
Volume :
3
fYear :
2012
fDate :
23-25 March 2012
Firstpage :
227
Lastpage :
230
Abstract :
A topic detection and tracking method combining semantic analysis with Suffix Tree Clustering (STC) algorithm is presented. A feature selection using NLP algorithm was introduced to select the noun, verb and name entity as the input of STC. Focusing on the topic drifting, we formed the VSM of cluster by the key words extracted from the nodes of suffix tree by mutual information algorithm. After the similarity computing of clusters and topic detection and tracking, a semantic analysis was introduced to filter the words with same meaning and analyze the semantic structure of words in label of cluster. Finally a content-relevant description was generated for each topic. The experiments showed that this method can detect and track the topics from the news articles effectively.
Keywords :
information analysis; pattern clustering; trees (mathematics); NLP algorithm; VSM; cluster similarity computing; content-relevant description; key words; mutual information algorithm; news articles; semantic analysis; suffix tree clustering; topic detection method; topic tracking method; Algorithm design and analysis; Clustering algorithms; Feature extraction; Information processing; Mutual information; Semantics; Vectors; STC; cluster; mutual information; semantic analysis; topic detection and tacking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Electronics Engineering (ICCSEE), 2012 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-0689-8
Type :
conf
DOI :
10.1109/ICCSEE.2012.131
Filename :
6188132
Link To Document :
بازگشت