DocumentCode
2812299
Title
Analyzing trends by symbolic episode representation and sequence alignment
Author
Balasko, B. ; Banko, Z. ; Abonyi, J.
Author_Institution
Univ. of Pannonia, Veszprem
fYear
2007
fDate
27-29 June 2007
Firstpage
1
Lastpage
6
Abstract
Data analysis is often associated with quantitative techniques because of the large amount of data and easy-to-use statistical tools. Qualitative trend analysis (QTA) techniques always have to be guided with some data reduction method, e.g. principal component analysis (PCA) or segmentation, and the preprocessed, lowered size data can be analyzed for further aims. Derivative-based segmentation methods are presented which are popular in fault diagnosis. If there is an adequate distance measure, one is able to qualify, compare or classify different time series. This article proposes segmentation-based alignment techniques based on dynamic distance measure: time warping (DTW) and a developed one, which uses pairwise sequence alignment -a common tool in bioinformatics -to align triangular episode sequences. Both techniques highly depend on the pre-defined distance or similarity measure between the trends because they try to find the minimal distance or maximal similarity path. These two techniques are compared and qualified on handwriting data based case study. It has been shown that symbolic episode segmentation based sequence alignment aided by prior knowledge of the operators can handle qualitative trend analysis and thus it is able to monitor and qualify operating processes.
Keywords
data analysis; data reduction; distance measurement; principal component analysis; sequences; time series; data analysis; data reduction; derivative-based segmentation methods; dynamic distance measurement; dynamic time warping; pairwise sequence alignment; principal component analysis; qualitative trend analysis; quantitative techniques; segmentation-based alignment techniques; statistical tools; symbolic episode representation; time series; triangular episode sequences alignment; Bioinformatics; DNA; Data analysis; Data mining; Fault diagnosis; Principal component analysis; Sequences; Shape measurement; Time measurement; Time series analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Control & Automation, 2007. MED '07. Mediterranean Conference on
Conference_Location
Athens
Print_ISBN
978-1-4244-1282-2
Electronic_ISBN
978-1-4244-1282-2
Type
conf
DOI
10.1109/MED.2007.4433862
Filename
4433862
Link To Document