Title :
Self-Organizing Maps for Topic Trend Discovery
Author :
Rzeszutek, Richard ; Androutsos, Dimitrios ; Kyan, Matthew
Author_Institution :
Dept. of Electr. & Comput. Eng., Ryerson Univ., Toronto, ON, Canada
fDate :
6/1/2010 12:00:00 AM
Abstract :
The large volume of data on the Internet makes it extremely difficult to extract high-level information, such as recurring or time-varying trends in document content. Dimensionality reduction techniques can be applied to simplify the analysis process but the amount of data is still quite large. If the analysis is restricted to just text documents then Latent Dirichlet Allocation (LDA) can be used to quantify semantic, or topical, groupings in the data set. This paper proposes a method that combines LDA with the visualization capabilities of Self-Organizing Maps to track topic trends over time. By examining the response of a map over time, it is possible to build a detailed picture of how the contents of a dataset change.
Keywords :
data analysis; data visualisation; information retrieval; moving average processes; self-organising feature maps; Internet; Latent Dirichlet Allocation; data analysis process; data visualization; dimensionality reduction techniques; information extraction; moving average; self-organizing maps; semantic quantification; topic trend discovery; Data processing; self-organizing feature maps; statistical analysis; time-series analysis; topic trending;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2010.2048940