DocumentCode :
479436
Title :
An Investigation on Multi-token List Based Proximity Search in Multi-dimensional Massive Database
Author :
Shen, Haiying ; Li, Ze ; Li, Ting
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Arkansas, Fayetteville, AR
Volume :
1
fYear :
2008
fDate :
11-13 Nov. 2008
Firstpage :
593
Lastpage :
598
Abstract :
A proximity search looks for similar complex documents such as images, sounds, DNA sequences that share two or more separately matching terms within a specified distance from within a large collection. Retrieving those similar complex documents are of great importance to many applications. To achieve an efficiency query process, many different access methods have been proposed. Token list based proximity search has been proved to be a good alternative method to the LSH for a large massive database proximity search. However, single-token based method leads to a high overhead in the results refinements process to achieve a required similarity. In this paper, we investigate how the multi-token list affects the performance of database proximity search. Numerous experiments have been conducted and the results show that two-token adjacent token list can achieve the best query performance in multi-token list based proximity search.
Keywords :
query formulation; query processing; information retrieval; multidimensional massive database; multitoken list; proximity search; query process; Acoustical engineering; Computer science; DNA; Data engineering; Image databases; Information technology; Multidimensional systems; Partitioning algorithms; Sequences; Speech processing; Massive database; Proximity Search; Tokenlist;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Convergence and Hybrid Information Technology, 2008. ICCIT '08. Third International Conference on
Conference_Location :
Busan
Print_ISBN :
978-0-7695-3407-7
Type :
conf
DOI :
10.1109/ICCIT.2008.277
Filename :
4682091
Link To Document :
بازگشت