Title :
The PU-Tree: A Partition-Based Uncertain High-Dimensional Indexing Algorithm
Author_Institution :
Coll. of Comput. & Inf. Eng., Zhejiang Gongshang Univ., Hangzhou, China
Abstract :
This paper proposes a partition-based uncertain high-dimensional indexing algorithm, called PU-Tree. In the PU-Tree, all (n)data objects are first grouped into some clusters by a k-Means clustering algorithm. Then each object´s corresponding uncertain sphere is partitioned into several slices in terms of the zero-distance. Finally a unified key of each data object is computed by adopting multi-attribute encoding scheme, which are inserted by a B+-tree. Thus, given a query object, its probabilistic range search in high-dimensional spaces is transformed into the search in the single dimensional space with the aid of the PU-Tree. Extensive performance studies are conducted to evaluate the effectiveness and efficiency of the proposed scheme.
Keywords :
indexing; pattern clustering; query processing; trees (mathematics); B+-tree; PU-tree; k-means clustering algorithm; multi-attribute encoding scheme; partition-based uncertain high-dimensional indexing algorithm; probabilistic range search; Clustering algorithms; Encoding; Indexing; Partitioning algorithms; Probabilistic logic; high-dimensional indexing; probabilistic query;
Conference_Titel :
Network and System Security (NSS), 2010 4th International Conference on
Conference_Location :
Melbourne, VIC
Print_ISBN :
978-1-4244-8484-3
Electronic_ISBN :
978-0-7695-4159-4
DOI :
10.1109/NSS.2010.60