Title :
Enhancing Edit Distance on Real Sequences Filters using Histogram Distance on Fixed Reference Ordering
Author :
Chairunnanda, Prima ; Gopalkrishnan, Vivekanand ; Chen, Lei
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ.
Abstract :
Distance functions are the main tools to measure similarity of two sequences and to search the closest sequences to given query sequence. Several well known distance functions, however, have asymptotical time complexity of O(mn) which cannot be fully afforded by systems that deal with large volumes of data. These distance functions, including edit distance on real sequences (EDR) (L. Chen et al., 2005), have pruning methods to reduce execution time by dismissing false candidates as early as possible. In this paper, we propose the histogram distance on fixed reference (HDFR) ordering, with various reference histogram construction methods, to improve the filtering power of the pruning methods in EDR. Experiments show that a decrease in EDR execution time is observed after HDFR is applied. While we base our experiments on EDR, HDFR can also be applied to other distance functions with appropriate pruning methods
Keywords :
computational complexity; pattern recognition; sequences; asymptotical time complexity; distance function; edit distance on real sequence; histogram distance on fixed reference ordering; pruning method; query sequence; reference histogram construction; similarity measurement; Computer science; Databases; Enterprise resource planning; Euclidean distance; Filtering; Histograms; Monitoring; Pattern recognition; Power filters; Surveillance;
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2521-0
DOI :
10.1109/ICPR.2006.492