DocumentCode :
2191446
Title :
Measuring Similarity for Multidimensional Sequences
Author :
Wang, Hui ; Lin, Zhiwei ; Mcclean, Sally ; Liu, Jun
Author_Institution :
Fac. of Comput. & Eng., Univ. of Ulster, Derry, UK
fYear :
2010
fDate :
13-13 Dec. 2010
Firstpage :
281
Lastpage :
287
Abstract :
Multidimensional sequences are common, and measuring their similarity is a key to any analysis of such data. There is a wealth of similarity measures for sequences in the literature, but most of them are designed for a special type of sequence and later extended to more general types. These extensions are usually ad hoc, and the extended versions may lose the original conceptual interpretation of the measure. In this paper we consider the problem of how to measure similarity for the general type of multidimensional sequences effectively in a conceptually uniform way. We show that the subsequence concept behind longest common subsequence and all common subsequences can be extended from the temporal dimension to the spatial dimension, and we generalize the all common subsequences similarity to multidimensional sequences. The hard problem is how to compute the generalized similarity. We present a theorem that combines the temporal and spatial dimensions in a simple formula. This theorem suggests a dynamic programming algorithm to compute the generalized similarity. A preliminary experiment shows that this similarity produces competitive outcomes. However, this approach counts some subsequences multiple times when a sequence has repeated elements. We present a theorem that allows counting of distinct common subsequences.
Keywords :
data analysis; dynamic programming; sequences; data analysis; dynamic programming algorithm; dynamic time warping; generalized similarity; longest common subsequence; multidimensional sequences; sequence similarity measure; spatial dimension; temporal dimension; all common subsequences; dynamic time warping; multidimensional sequences; similarity; the longest common subsequence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2010 IEEE International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-9244-2
Electronic_ISBN :
978-0-7695-4257-7
Type :
conf
DOI :
10.1109/ICDMW.2010.202
Filename :
5693311
Link To Document :
بازگشت