• DocumentCode
    1388319
  • Title

    On Group Nearest Group Query Processing

  • Author

    Deng, Ke ; Sadiq, Shazia ; Zhou, Xiaofang ; Xu, Hu ; Fung, Gabriel Pui Cheong ; Lu, Yansheng

  • Author_Institution
    Sch. of Inf. Technol. & Electr. Eng., Univ. of Queensland, Brisbane, QLD, Australia
  • Volume
    24
  • Issue
    2
  • fYear
    2012
  • Firstpage
    295
  • Lastpage
    308
  • Abstract
    Given a data point set D, a query point set Q, and an integer k, the Group Nearest Group (GNG) query finds a subset ω (|ω| ≤ k)of points from Dsuch that the total distance from all points in Q to the nearest point in ω is not greater than any other subset ω´ (|ω´| ≤ k) of points in D. GNG query is a partition-based clustering problem which can be found in many real applications and is NP-hard. In this paper, Exhaustive Hierarchical Combination (EHC) algorithm and Subset Hierarchial Refinement (SHR) algorithm are developed for GNG query processing. While EHC is capable to provide the optimal solution for k = 2, SHR is an efficient approximate approach that combines database techniques with local search heuristic. The processing focus of our approaches is on minimizing the access and evaluation of subsets of cardinality k in D since the number of such subsets is exponentially greater than |D|. To do that, the hierarchical blocks of data points at high level are used to find an intermediate solution and then refined by following the guided search direction at low level so as to prune irrelevant subsets. The comprehensive experiments on both real and synthetic data sets demonstrate the superiority of SHR in terms of efficiency and quality.
  • Keywords
    query processing; set theory; EHC algorithm; SHR algorithm; data point set; exhaustive hierarchical combination algorithm; group nearest group query processing; local search heuristic; query point set; subset hierarchial refinement algorithm; Clustering algorithms; Knowledge engineering; Nearest neighbor searches; Optimization; Partitioning algorithms; Query processing; K-median clustering; group nearest group query; group nearest neighbor query.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2010.230
  • Filename
    5645619