• DocumentCode
    606060
  • Title

    Enhanced symbolic aggregate approximation method for financial time series data representation

  • Author

    Barnaghi, P.M. ; Bakar, Afarulrazi Abu ; Othman, Zulaiha Ali

  • Author_Institution
    Data Min. & Optimization Res. Group, Univ. Kebangsaan Malaysia (UKM), Bangi, Malaysia
  • fYear
    2012
  • fDate
    23-25 Oct. 2012
  • Firstpage
    790
  • Lastpage
    795
  • Abstract
    Data representation is one of the most important tasks in time series data pre-processing. Time series data representation is required to make the data more suitable for data mining specifically for prediction. Time series data is characterized by its numerical and continuous values. One of the data representation methods for time series is the Symbolic Aggregate Approximation (SAX) which uses mean values as the basis of representation of the data. However. representing the time series financial data with the mean value often causes the loss of patterns that can describes important pieces of information. The aim of this study is to propose an enhancement of SAX representation purposely for the financial time series data. The Enhanced SAX (EN-SAX) adds two new values to the original mean value for each segment in SAX. These values enable better representation for each segment in a lower dimension and keep some of the important patterns that are meaningful in financial time series data. The experimental results show that the EN-SAX representation manages to give lower error rates compared to SAX and improves the prediction accuracy.
  • Keywords
    approximation theory; data mining; data structures; financial data processing; time series; EN-SAX representation; SAX representation enhancement; continuous values; data mining; enhanced symbolic aggregate approximation method; error rates; financial time series data representation; numerical values; prediction accuracy; time series data pre-processing; Financial time series data; dimensionality reduction; symbolic aggregate approximation (SAX); time series data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Service Science and Data Mining (ISSDM), 2012 6th International Conference on New Trends in
  • Conference_Location
    Taipei
  • Print_ISBN
    978-1-4673-0876-2
  • Type

    conf

  • Filename
    6528740