Title :
Extracting discriminative shapelets from heterogeneous sensor data
Author :
Patri, Om P. ; Sharma, Abhishek B. ; Haifeng Chen ; Guofei Jiang ; Panangadan, Anand V. ; Prasanna, Viktor K.
Author_Institution :
Univ. of Southern California, Los Angeles, CA, USA
Abstract :
We study the problem of identifying discriminative features in Big Data arising from heterogeneous sensors. We highlight the heterogeneity in sensor data from engineering applications and the challenges involved in automatically extracting only the most interesting features from large datasets. We formulate this problem as that of classification of multivariate time series and design shapelet-based algorithms for this task. We design a novel approach, called Shapelet Forests (SF), which combines shapelet extraction with feature selection. We evaluate our proposed method with other approaches for mining shapelets from multivariate time series using data from real-world engineering applications. Quantitative analysis of the experiments shows that SF performs better than the baseline approaches and achieves high classification accuracy. In addition, the method enables identification of noisy sensors from multivariate data and discounts their use for classification.
Keywords :
Big Data; data mining; feature selection; time series; Big Data; SF; Shapelet Forests; discriminative shapelets; feature selection; heterogeneous sensor data; large datasets; multivariate data; multivariate time series; shapelet extraction; shapelet-based algorithms; Data mining; Decision trees; Feature extraction; Kernel; Monitoring; Time series analysis; Training; Feature Selection; Multivariate Data; Shapelet Forests; Time Series Shapelets; mRMR;
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/BigData.2014.7004344