Title :
TSA-tree: a wavelet-based approach to improve the efficiency of multi-level surprise and trend queries on time-series data
Author :
Shahabi, Cyrus ; Tian, Xiaoming ; Zhao, Wugang
Author_Institution :
Dept. of Comput. Sci., Univ. of Southern California, Los Angeles, CA, USA
Abstract :
We introduce a novel wavelet based tree structure, termed TSA-tree, which improves the efficiency of multi-level trend and surprise queries on time sequence data. With the explosion of scientific observation data conceptualized as time sequences, we are facing the challenge of efficiently storing, retrieving and analyzing this data. Frequent queries on this data set are to find trends (e.g., global warming) or surprises (e.g., undersea volcano eruption) within the original time series. The challenge, however is that these trend and surprise queries are needed at different levels of abstractions. To support these multi-level trend and surprise queries, sometimes a huge subset of raw data needs to be retrieved and processed. To expedite this process, we utilize our TSA-tree. Each node of the TSA-tree contains pre-computed trends and surprises at different levels. A wavelet transform is used recursively to construct TSA nodes. As a result, each node of TSA tree is readily available for visualization of trends and surprises. In addition, the size of each node is significantly smaller than that of the original time series, resulting in faster I/O operations. However a limitation of TSA-tree is that its size is larger than the original time series. To address this shortcoming, first we prove that the storage space required to store the optimal subtree of TSA-tree (OTSA-tree) is no more than that required to store the original time series without losing any information. Next, we propose two alternative techniques to reduce the size of the OTSA-tree even further while maintaining an acceptable query precision as compared to querying the original time sequences. Utilizing real and synthetic time sequence databases, we compare our techniques with some well known algorithms
Keywords :
query processing; scientific information systems; statistical databases; temporal databases; time series; tree data structures; trees (mathematics); wavelet transforms; I/O operations; OTSA-tree; TSA nodes; TSA-tree; data set; multi-level surprise; optimal subtree; pre-computed trends; query precision; raw data retrieval; scientific observation data; storage space; surprise queries; synthetic time sequence databases; time sequence data; time series data; trend queries; wavelet based approach; wavelet based tree structure; wavelet transform; Computer science; Data analysis; Databases; Explosions; Global warming; Land surface temperature; NASA; Satellite ground stations; Tree data structures; Volcanoes;
Conference_Titel :
Scientific and Statistical Database Management, 2000. Proceedings. 12th International Conference on
Conference_Location :
Berlin
Print_ISBN :
0-7695-0686-0
DOI :
10.1109/SSDM.2000.869778