Title :
Novel
-Approximation to Data Streams in Sensor Networks
Author :
Jianzhong Li ; Guohua Li ; Hong Gao
Author_Institution :
Dept. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Abstract :
We are gradually moving into a realm where sensors, processors, memory and wireless transceivers would be seamlessly integrated together in the physical world and form a wireless sensor network. Such networks pose new challenges in data processing and transmission due to the characteristic of limited communication bandwidth and other resource constraints of sensor networks. To reduce the cost of storage, transmission and processing of time series data generated by sensor nodes, the need for more compact representations of time series data is compelling. Although a large number of data compression algorithms have been proposed to reduce data volume, their offline characteristic or super-linear time complexity prevents them from being applied directly on time series data generated by sensor nodes. Motivated by these observations, we propose an optimal online algorithm GDPLA for constructing a disconnected piecewise linear approximation of a time series which guarantees that the vertical distance between each real data point and the corresponding fit line is less than or equal to ". GDPLA not only generates the minimum number of segments to approximate a time series with precision guarantee, but also only requires linear time O(n) bounded by a constant coefficient 6, where unit 1 denotes the time complexity of comparing the slopes of two lines. The low cost characteristic of our method makes it a proper choice for resource-constrained WSNs. Extensive experiments on two real data sets have been conducted to demonstrate the superior compression performance of our method.
Keywords :
approximation theory; communication complexity; data communication; data compression; piecewise linear techniques; time series; wireless sensor networks; GDPLA; constant coefficient; data compression algorithm; data point; data processing; data stream; data transmission; disconnected piecewise linear approximation; e-approximation; limited communication bandwidth; optimal online algorithm; resource constrained WSN; sensor node; super-linear time complexity; time series data generation; time series data representation; wireless sensor network; Approximation algorithms; Linear approximation; Piecewise linear approximation; Programmable logic arrays; Time series analysis; Wireless sensor networks; ??-approximation; Sensor networks; data streams; piecewise linear approximation;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
DOI :
10.1109/TPDS.2014.2323056