DocumentCode :
1796761
Title :
Exploring the Use of Diverse Replicas for Big Location Tracking Data
Author :
Ye Ding ; Haoyu Tan ; Wuman Luo ; Ni, Lionel M.
Author_Institution :
Dept. of Comput. Sci. & Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
fYear :
2014
fDate :
June 30 2014-July 3 2014
Firstpage :
83
Lastpage :
92
Abstract :
The value of large amount of location tracking data has received wide attention in many applications including human behavior analysis, urban transportation planning, and various location-based services (LBS). Nowadays, both scientific and industrial communities are encouraged to collect as much location tracking data as possible, which brings about two issues: 1) it is challenging to process the queries on big location tracking data efficiently, and 2) it is expensive to store several exact data replicas for fault-tolerance. So far, several dedicated storage systems have been proposed to address these issues. However, they do not work well when the query ranges vary widely. In this paper, we present the design of a storage system using diverse replica scheme which improves the query processing efficiency with reduced cost of storage space. To the best of our knowledge, we are the first to investigate the data storage and processing in the context of big location tracking data. Specifically, we conduct in-depth theoretical and empirical analysis of the trade-offs between different spatio-temporal partitioning schemes as well as data encoding schemes. Then we propose an effective approach to select an appropriate set of diverse replicas, which is optimized for the expected query loads while conforming to the given storage space budget. The experiment results confirm that using diverse replicas can significantly improve the overall query performance. The results also demonstrate that the proposed algorithms for the replica selection problem is both effective and efficient.
Keywords :
query processing; replicated databases; storage management; LBS; big location tracking data; data encoding schemes; data processing; data replicas; data storage; dedicated storage systems; diverse replica scheme; diverse replicas; fault-tolerance; human behavior analysis; industrial communities; location-based services; query loads; query performance; query processing efficiency; replica selection problem; scientific communities; spatio-temporal partitioning schemes; storage space budget; storage space cost reduction; storage system design; urban transportation planning; Algorithm design and analysis; Big data; Context; Encoding; Linear programming; Organizations; Query processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Distributed Computing Systems (ICDCS), 2014 IEEE 34th International Conference on
Conference_Location :
Madrid
ISSN :
1063-6927
Print_ISBN :
978-1-4799-5168-0
Type :
conf
DOI :
10.1109/ICDCS.2014.17
Filename :
6888885
Link To Document :
بازگشت