DocumentCode :
1365694
Title :
TextFlow: Towards Better Understanding of Evolving Topics in Text
Author :
Cui, Weiwei ; Liu, Shixia ; Tan, Li ; Shi, Conglei ; Song, Yangqiu ; Gao, Zekai J. ; Qu, Huamin ; Tong, Xin
Author_Institution :
Hong Kong Univ. of Sci. & Technol., Hong Kong, China
Volume :
17
Issue :
12
fYear :
2011
Firstpage :
2412
Lastpage :
2421
Abstract :
Understanding how topics evolve in text data is an important and challenging task. Although much work has been devoted to topic analysis, the study of topic evolution has largely been limited to individual topics. In this paper, we introduce TextFlow, a seamless integration of visualization and topic mining techniques, for analyzing various evolution patterns that emerge from multiple topics. We first extend an existing analysis technique to extract three-level features: the topic evolution trend, the critical event, and the keyword correlation. Then a coherent visualization that consists of three new visual components is designed to convey complex relationships between them. Through interaction, the topic mining model and visualization can communicate with each other to help users refine the analysis result and gain insights into the data progressively. Finally, two case studies are conducted to demonstrate the effectiveness and usefulness of TextFlow in helping users understand the major topic evolution patterns in time-varying text data.
Keywords :
data mining; data visualisation; feature extraction; text analysis; TextFlow; coherent visualization; convey complex relationships; critical event; keyword correlation; three-level feature extraction; topic evolution trend; topic mining model; topic mining technique; topics evolution; visualization technique; Data visualization; Image color analysis; Tag clouds; Text analysis; Critical event.; Hierarchical Dirichlet process; Text visualization; Topic evolution;
fLanguage :
English
Journal_Title :
Visualization and Computer Graphics, IEEE Transactions on
Publisher :
ieee
ISSN :
1077-2626
Type :
jour
DOI :
10.1109/TVCG.2011.239
Filename :
6065008
Link To Document :
بازگشت