Title :
Surrogate join for massive data on tertiary storage system
Author :
Liu, Baoliang ; Li, Jianzhong ; Zhang, Yanqiu
Author_Institution :
Harbin Inst. of Technol., Heilongjiang, China
Abstract :
In This work surrogate join (SJ) for massive data on tertiary storage is presented. The relations to be joined are first split into surrogate relations and nonsurrogate relations. Surrogate relation consists of tuple identifier and join attribute and nonsurrogate relation consists of tuple identifier and nonjoin attributes. Join is first performed on the two surrogate relations and a join result index is produced which consists of the identifiers of the matching tuples of both surrogate relations, then the join result index is merged with both nonsurrogate relations to get final join result. Experimental results show that our method is better than previous ones in performance and scalability. Note that SJ can convert tertiary join into disk join and one pass scan of both tertiary resident nonsurrogate relations for most applications.
Keywords :
database indexing; disk join; join attribute; join result index; nonjoin attribute; nonsurrogate relations; surrogate join; surrogate relations; tertiary join; tuple identifier; Costs; Database systems; Earth; Magnetic devices; Magnetic switching; Memory; Mobile handsets; Random media; Scalability; Switches;
Conference_Titel :
Database Engineering and Applications Symposium, 2004. IDEAS '04. Proceedings. International
Print_ISBN :
0-7695-2168-1
DOI :
10.1109/IDEAS.2004.1319800