Title :
A Data Reusing Strategy Based on Column-Stores
Author :
Mei Wang ; Jiaoling Zhou ; Yue Li ; Xiaoling Xia ; Jiajin Le
Author_Institution :
Donghua Univ., Shanghai, China
Abstract :
Data reusing is an important way to save storage capacity and improve query efficiency in the management of massive data. The column-store architecture stores data from the same column continuously, which greatly improves the performance of ´read optimization´ application and moreover increases the feasibility and flexibility of data reusing. In this paper, we propose a novel reusing method based on the column-store data warehouse. Firstly, we propose an improved iMAP method based on the schema mapping technique to generate as more candidate reusable columns as possible and then conduct further filter on these candidate data, which greatly reduces the complexity of reusable data detection. Based on the column-store architecture, we then propose the reuse implement at the storage layer. The method for query execution based on reusable data is provided finally. The experiment results conducted on the real data sets indicate that the presented strategy can reduce the storage space and query execution time efficiently.
Keywords :
data handling; data warehouses; query processing; storage management; column-store architecture; column-store data warehouse; data reusing strategy; iMAP method; massive data management; query efficiency; query execution method; query execution time; read optimization application; reusable data detection; schema mapping technique; storage capacity; storage space; Cities and towns; Data warehouses; Databases; Feature extraction; Machinery; Redundancy; Vectors; column-store; data reusing; massive data; schema mapping;
Conference_Titel :
Dependable, Autonomic and Secure Computing (DASC), 2013 IEEE 11th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4799-3380-8
DOI :
10.1109/DASC.2013.56