DocumentCode :
2850110
Title :
SCHISM: a new approach for interesting subspace mining
Author :
Sequeira, Karlton ; Zaki, Mohammed
Author_Institution :
Dept. of Comput. Sci., Rensselaer Polytech. Inst., Troy, NY, USA
fYear :
2004
fDate :
1-4 Nov. 2004
Firstpage :
186
Lastpage :
193
Abstract :
High-dimensional data pose challenges to traditional clustering algorithms due to their inherent sparsity and data tend to cluster in different and possibly overlapping subspaces of the entire feature space. Finding such subspaces is called subspace mining. We present SCHISM, a new algorithm for mining interesting subspaces, using the notions of support and Chernoff-Hoeffding bounds. We use a vertical representation of the dataset, and use a depth-first search with backtracking to find maximal interesting subspaces. We test our algorithm on a number of high-dimensional synthetic and real datasets to test its effectiveness.
Keywords :
backtracking; data mining; pattern clustering; tree searching; Chernoff-Hoeffding bound; SCHISM; backtracking; clustering algorithm; depth-first search; feature space; interesting subspaces; subspace mining; vertical dataset representation; Clustering algorithms; Computer science; Engineering profession; Karhunen-Loeve transforms; Multidimensional systems; Partitioning algorithms; Singular value decomposition; Testing; US Department of Energy; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
Type :
conf
DOI :
10.1109/ICDM.2004.10099
Filename :
1410283
Link To Document :
بازگشت