• DocumentCode
    3363012
  • Title

    Similarity join for low-and high-dimensional data

  • Author

    Kalashnikov, Dmitri V. ; Prabhakar, Sunil

  • Author_Institution
    CS Dept, Purdue Univ., West Lafayette, IN, USA
  • fYear
    2003
  • fDate
    26-28 March 2003
  • Firstpage
    7
  • Lastpage
    16
  • Abstract
    The efficient processing of similarity joins is important for a large class of applications. The dimensionality of the data for these applications ranges from low to high. Most existing methods have focussed on the execution of high-dimensional joins over large amounts of disk-based data. The increasing sizes of main memory available on current computers, and the need for efficient processing of spatial joins suggest that spatial joins for a large class of problems can be processed in main memory. In this paper we develop two new spatial join algorithms, the Grid-join and EGO-join, and study their performance in comparison to the state of the art algorithm EGO-join and the RSJ algorithm. Through evaluation we explore the domain of applicability of each algorithm and provide recommendations for the choice of join algorithm depending upon the dimensionality of the data as well as the critical /spl epsiv/ parameter. We also point out the significance of the choice of this parameter for ensuring that the selectivity achieved is reasonable.
  • Keywords
    merging; relational algebra; visual databases; EGO-join; Grid-join; RSJ algorithm; art algorithm; dimensionality; disk-based data; high-dimensional data; similarity joins; spatial joins; Algorithm design and analysis; Application software; Data mining; Database systems; Engineering profession; Geographic Information Systems; Multimedia databases; Spatial databases; Time series analysis; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings. Eighth International Conference on
  • Conference_Location
    Kyoto, Japan
  • Print_ISBN
    0-7695-1895-8
  • Type

    conf

  • DOI
    10.1109/DASFAA.2003.1192363
  • Filename
    1192363