• DocumentCode
    945849
  • Title

    Efficient Similarity Search over Future Stream Time Series

  • Author

    Lian, Xiang ; Chen, Lei

  • Author_Institution
    Hong Kong Univ. of Sci. & Technol., Hong Kong
  • Volume
    20
  • Issue
    1
  • fYear
    2008
  • Firstpage
    40
  • Lastpage
    54
  • Abstract
    With the advance of hardware and communication technologies, stream time series is gaining ever-increasing attention due to its importance in many applications such as financial data processing, network monitoring, Web click-stream analysis, sensor data mining, and anomaly detection. For all of these applications, an efficient and effective similarity search over stream data is essential. Because of the unique characteristics of the stream, for example, data are frequently updated and real-time response is required, the previous approaches proposed for searching through archived data may not work in the stream scenarios. Especially, in the cases where data often arrive periodically for various reasons (for example, the communication congestion or batch processing), queries on such incomplete time series or even future time series may result in inaccuracy using traditional approaches. Therefore, in this paper, we propose three approaches, polynomial, Discrete Fourier Transform (DFT), and probabilistic, to predict the unknown values that have not arrived at the system and answer similarity queries based on the predicted data. We also apply efficient indexes, that is, a multidimensional hash index and a B+-tree, to facilitate the prediction and similarity search on future time series, respectively. Extensive experiments demonstrate the efficiency and effectiveness of our methods for prediction and answering queries.
  • Keywords
    discrete Fourier transforms; polynomials; probability; query processing; time series; Web click-stream analysis; anomaly detection; discrete Fourier transform; financial data processing; future stream time series; multidimensional hash index; network monitoring; polynomial; probabilistic; sensor data mining; similarity search; Information Search and Retrieval; Multimedia databases; Query processing; Search process;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2007.190666
  • Filename
    4358952