DocumentCode :
3055580
Title :
Range queries in natural language dictionaries with recursive lists of clusters
Author :
Mamede, Margarida ; Barbosa, Fernanda
Author_Institution :
UNL, Caparica
fYear :
2007
fDate :
7-9 Nov. 2007
Firstpage :
1
Lastpage :
6
Abstract :
We evaluate the performance of range queries in the Recursive List of Clusters (RLC) metric data structure, when the metric spaces are natural language dictionaries with the Levenshtein distance. The study compares RLC with five data structures (GNAT, H-Dsatl, LAESA, LC, and vp-trees) and comprises six dictionaries. The natural language dictionaries (in English, French, German, Italian, Portuguese, and Spanish), are characterised according to the mean and the variance of the histograms of distances. The experimental results show that RLC has a good performance in all tested cases and, in some of them, it outperforms all the other data structures. In addition, RLC is the only data structure that always keeps its good performance, whether the space dimension is lower or higher, and whether the query radius is smaller or larger.
Keywords :
data structures; dictionaries; natural languages; query processing; metric data structure; natural language dictionaries; range queries; recursive lists of clusters; DNA; Data structures; Dictionaries; Extraterrestrial measurements; Histograms; Image databases; Multimedia databases; Natural languages; Spatial databases; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and information sciences, 2007. iscis 2007. 22nd international symposium on
Conference_Location :
Ankara
Print_ISBN :
978-1-4244-1363-8
Electronic_ISBN :
978-1-4244-1364-5
Type :
conf
DOI :
10.1109/ISCIS.2007.4456857
Filename :
4456857
Link To Document :
بازگشت