Title :
Scalable construction of topic directory with nonparametric closed termset mining
Author :
Yu, Hwanjo ; Searsmith, Duane ; Li, Xiaolei ; Han, Jiawei
Author_Institution :
Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA
Abstract :
A topic directory, e.g., Yahoo directory, provides a view of a document set at different levels of abstraction and is ideal for the interactive exploration and visualization of the document set. We present a method that dynamically generates a topic directory from a document set using a frequent closed termset mining algorithm. Our method shows experimental results of equal quality to recent document clustering methods and has additional benefits such as automatic generation of topic labels and determination of a clustering parameter.
Keywords :
data mining; document handling; pattern clustering; Yahoo directory; automatic generation; document clustering; hierarchical clustering; nonparametric closed termset mining; topic directory; Clustering algorithms; Clustering methods; Computer science; Data mining; Itemsets; Organizing; Permission; Taxonomy; Tree graphs; Visualization; document clustering; hierarchical clustering; topic directory;
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
DOI :
10.1109/ICDM.2004.10056