DocumentCode :
1396006
Title :
Querying time series data based on similarity
Author :
Rafiei, Davood ; Mendelzon, Alberto O.
Author_Institution :
Dept. of Comput. Sci., Alberta Univ., Edmonton, Alta., Canada
Volume :
12
Issue :
5
fYear :
2000
Firstpage :
675
Lastpage :
693
Abstract :
We study similarity queries for time series data where similarity is defined, in a fairly general way, in terms of a distance function and a set of affine transformations on the Fourier series representation of a sequence. We identify a safe set of transformations supporting a wide variety of comparisons and show that this set is rich enough to formulate operations such as moving average and time scaling. We also show that queries expressed using safe transformations can efficiently be computed without prior knowledge of the transformations. We present a query processing algorithm that uses the underlying multidimensional index built over the data set to efficiently answer similarity queries. Our experiments show that the performance of this algorithm is competitive to that of processing ordinary (exact match) queries using the index, and much faster than sequential scanning. We propose a generalization of this algorithm for simultaneously handling multiple transformations at a time, and give experimental results on the performance of the generalized algorithm
Keywords :
Fourier series; data mining; data warehouses; query processing; time series; Fourier series representation; affine transformations; distance function; moving average; performance; query processing algorithm; similarity queries; time scaling; time series data query; Data mining; Databases; Euclidean distance; Fluctuations; Fourier series; Fourier transforms; Indexing; Information retrieval; Multidimensional systems; Query processing;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.877502
Filename :
877502
Link To Document :
بازگشت