• DocumentCode
    1343947
  • Title

    Efficient Routing of Subspace Skyline Queries over Highly Distributed Data

  • Author

    Vlachou, Akrivi ; Doulkeridis, Christos ; Kotidis, Yannis ; Vazirgiannis, Michalis

  • Author_Institution
    Dept. of Comput. & Inf. Sci. (IDI), Norwegian Univ. of Sci. & Technol. (NTNU), Trondheim, Norway
  • Volume
    22
  • Issue
    12
  • fYear
    2010
  • Firstpage
    1694
  • Lastpage
    1708
  • Abstract
    Data generation increases at highly dynamic rates, making its storage, processing, and update costs at one central location excessive. The P2P paradigm emerges as a powerful model for organizing and searching large data repositories distributed over independent sources. Advanced query operators, such as skyline queries, are necessary in order to help users handle the huge amount of available data. A skyline query retrieves the set of nondominated data points in a multidimensional data set. Skyline query processing in P2P networks poses inherent challenges and demands nontraditional techniques, due to the distribution of content and the lack of global knowledge. Relying on a superpeer architecture, we propose a threshold-based algorithm, called SKYPEER and its variants, for efficient computation of skyline points in arbitrary subspaces, while reducing both computational time and volume of transmitted data. Furthermore, we address the problem of routing skyline queries over the superpeer network and we propose an efficient routing mechanism, namely SKYPEER+, which further improves the performance by reducing the number of contacted superpeers. Finally, we provide an extensive experimental evaluation showing that our approach performs efficiently and provides a viable solution when a large degree of distribution is required.
  • Keywords
    distributed databases; peer-to-peer computing; query processing; SKYPEER algorithm; data generation; distributed data; peer-to-peer networks; skyline query processing; subspace skyline query routing; superpeer architecture; Computer architecture; Costs; Distributed computing; Distributed power generation; Information retrieval; Information systems; Network servers; Organizing; Query processing; Routing; Skyline queries; peer-to-peer systems; routing indexes.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2009.204
  • Filename
    5342419