Title :
Visualizing sequential patterns for text mining
Author :
Wong, Pak Chung ; Cowley, Wendy ; Foote, Harlan ; Jurrus, Elizabeth ; Thomas, Jim
Author_Institution :
Pacific Northwest Lab., Richland, WA, USA
Abstract :
A sequential pattern in data mining is a finite series of elements such as A→B→C→D where A, B, C, and D are elements of the same domain. The mining of sequential patterns is designed to find patterns of discrete events that frequently happen in the same arrangement along a timeline. Like association and clustering, the mining of sequential patterns is among the most popular knowledge discovery techniques that apply statistical measures to extract useful information from large datasets. As out computers become more powerful, we are able to mine bigger datasets and obtain hundreds of thousands of sequential patterns in full detail. With this vast amount of data, we argue that neither data mining nor visualization by itself can manage the information and reflect the knowledge effectively. Subsequently, we apply visualization to augment data mining in a study of sequential patterns in large text corpora. The result shows that we can learn more and more quickly in an integrated visual data-mining environment
Keywords :
data mining; data visualisation; pattern recognition; very large databases; data mining; knowledge discovery; large datasets; large text corpora; sequential pattern visualization; statistical measures; text mining; Association rules; Data mining; Data visualization; Event detection; Laboratories; Motion pictures; Ores; Read only memory; Statistics; Text mining;
Conference_Titel :
Information Visualization, 2000. InfoVis 2000. IEEE Symposium on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7695-0804-9
DOI :
10.1109/INFVIS.2000.885097