DocumentCode :
2753805
Title :
Cluster-Based Similarity Search in Time Series
Author :
Karamitopoulos, Leonidas ; Evangelidis, Georgios
Author_Institution :
Dept. of Appl. Inf., Univ. of Macedonia Thessaloniki, Thessaloniki, Greece
fYear :
2009
fDate :
17-19 Sept. 2009
Firstpage :
113
Lastpage :
118
Abstract :
In this paper, we present a new method that accelerates similarity search implemented via one-nearest neighbor on time series data. The main idea is to identify the most similar time series to a given query without necessarily searching over the whole database. Our method is based on partitioning the search space by applying the K-means algorithm on the data. Then, similarity search is performed hierarchically starting from the cluster that lies most closely to the query. This procedure aims at reaching the most similar series without searching all clusters. In this work, we propose to reduce the intrinsically high dimensionality of time series prior to clustering by applying a well known dimensionality reduction technique, namely, the piecewise aggregate approximation, for its simplicity and efficiency. Experiments are conducted on twelve real-world and synthetic datasets covering a wide range of applications.
Keywords :
approximation theory; data mining; information retrieval; pattern clustering; time series; K-means algorithm; cluster-based similarity search; data mining; dimensionality reduction technique; one-nearest neighbor; piecewise aggregate approximation; time series data; Aggregates; Data mining; Databases; Degradation; Discrete Fourier transforms; Indexing; Informatics; Information retrieval; Multidimensional systems; Nearest neighbor searches; clustering; data mining; similarity search; time series;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Informatics, 2009. BCI '09. Fourth Balkan Conference in
Conference_Location :
Thessaloniki
Print_ISBN :
978-0-7695-3783-2
Type :
conf
DOI :
10.1109/BCI.2009.22
Filename :
5359309
Link To Document :
بازگشت