Title :
Mining Uncertain Data in Low-dimensional Subspace
Author :
Yu, Zhiwen ; Wong, Hau-San
Author_Institution :
Dept. of Comput. Sci., City Univ. of Hong Kong
Abstract :
Mining for clusters in a database with uncertain data is a hot topic in many application areas, such as sensor database, location database, face recognition system and so on. Since it is commonly assumed that most of the objects which are contained in a high-dimensional dataset are located in a low-dimensional sub-space, mining clusters in a subspace in an uncertain database is a new task. In this paper, we adopt and combine fractal correlation dimension with fuzzy distance function to find out the clusters in a low-dimensional subspace in an uncertain database. We also propose the fuzzy kth NN algorithm to retrieve the kth nearest neighbor which can accelerate the process of mining. The experiments show that the new algorithm works well in an uncertain database
Keywords :
correlation methods; data mining; fractals; fuzzy set theory; pattern clustering; cluster mining; data mining; fractal correlation dimension; fuzzy distance function; fuzzy kth NN algorithm; low-dimensional subspace; uncertain database; Application software; Clustering algorithms; Computer science; Data mining; Databases; Distribution functions; Face recognition; Fractals; Probability density function; Sensor systems and applications;
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2521-0
DOI :
10.1109/ICPR.2006.801