• DocumentCode
    517719
  • Title

    A Novel Similarity Measure Framework on Financial Data Mining

  • Author

    Fangwen, Zhai ; Zehong, Yang ; Yixu, Song ; Yi, Liu

  • Author_Institution
    State Key Lab. on Intell. Technol., Tsinghua Univ., Beijing, China
  • Volume
    1
  • fYear
    2010
  • fDate
    24-25 April 2010
  • Firstpage
    505
  • Lastpage
    508
  • Abstract
    Similarity measure is one of the core problems in data mining research on time series. Both Euclidean Distance and Dynamic Time Warping methods are widely used in similarity measure, which have their own limitations. Based on the symbolic time series data, this paper creatively brought forward a new kind of similarity measure framework SFVM (SAX Feature Vector Model). Based on the architecture of SFVM, we introduced three feature extraction methods, which can be extended as three new similarity measure methods. The paper obtained stock data from the Standard & Poor´s 500 index, and compared the behavior of several similarity measure methods in clustering research. The experiment results showed that the three methods could efficiently measure the similarity according to the whole trend by comparing with the classic methods.
  • Keywords
    data mining; feature extraction; financial data processing; time series; SAX feature vector model framework; dynamic time warping methods; euclidean distance; feature extraction methods; financial data mining; stock data; symbolic time series data; Communication system security; Computer security; Data analysis; Data mining; Data security; Euclidean distance; Feature extraction; Laboratories; Time measurement; Time series analysis; financial data mining; similarity measure; symbolic method; time series;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networks Security Wireless Communications and Trusted Computing (NSWCTC), 2010 Second International Conference on
  • Conference_Location
    Wuhan, Hubei
  • Print_ISBN
    978-0-7695-4011-5
  • Electronic_ISBN
    978-1-4244-6598-9
  • Type

    conf

  • DOI
    10.1109/NSWCTC.2010.126
  • Filename
    5480794