• DocumentCode
    1827643
  • Title

    Data Independent Method of Constructing Distributed LSH for Large-Scale Dynamic High-Dimensional Indexing

  • Author

    Gu, Xiaoguang ; Zhang, Lei ; Zhang, Dongming ; Zhang, Yongdong ; Li, Jintao ; Bao, Ning

  • Author_Institution
    Inst. of Comput. Technol., Beijing, China
  • fYear
    2012
  • fDate
    25-27 June 2012
  • Firstpage
    564
  • Lastpage
    571
  • Abstract
    Constructing effective and efficient indexes for explosive growing multimedia data is a very challenging problem. To solve the problem, Haghani et al. provide a distributed similarity search method in high dimensions using Locality Sensitive Hashing. However, their method needs to estimate a global parameter on the whole dataset beforehand. It is impractical for a large-scale dynamical dataset. This paper proposes a novel constructing method of distributed LSH which does not need any priori knowledge about the dataset. Through generating the hash function with consistent output distribution, we get a data independent predicting model in theory which can guarantee a well load balance even if the dataset dynamically changes. Furthermore, we modify the query algorithm of the basic LSH to make the proposed model more practical. The experimental results on two open large-scale high-dimensional datasets show that the proposed method is more robust, scalable and practical than state-of-the-art.
  • Keywords
    file organisation; indexing; multimedia computing; query processing; resource allocation; search problems; data independent predicting model; distributed LSH construction; distributed similarity search method; global parameter estimation; hash function; indexing; load balance; locality sensitive hashing; multimedia data; query algorithm; Data models; Distributed databases; Gaussian distribution; Indexes; Load modeling; Peer to peer computing; Predictive models; Data Independent; Distributed Similarity Search; Locality Sensitive Hashing; Peer-to-Peer;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
  • Conference_Location
    Liverpool
  • Print_ISBN
    978-1-4673-2164-8
  • Type

    conf

  • DOI
    10.1109/HPCC.2012.82
  • Filename
    6332221