Title :
Column-based RLE in row-oriented database
Author_Institution :
Key Lab. of Comput. Syst. & Archit., Grad. Univ. of Chinese Acad. of Sci., Beijing, China
Abstract :
In database systems, disk I/O performance is usually the bottleneck of the whole query processing. Among many techniques, compression is one of the most important ones to reduce disk accesses so to improve system performance. RLE (run-length encoding) is one light-weight compression algorithm which incurs negligible CPU cost. A lot of work show that, although RLE is one of the most effective compression techniques in column-oriented systems, it is very hard to use due to bad value locality in row-oriented systems where values from multiple attributes are stored in the same page. We propose CRLE (Column-based RLE), one compression algorithm to apply RLE to row-oriented data storage. On row-oriented storage page, CRLE can exploit value locality in individual column and encode values from the same column in run-length format. Experiments show that CRLE can lead to very good compression ratio and performance in spite of row-oriented data storage.
Keywords :
database management systems; query processing; column based run-length encoding; disk I-O performance; query processing; row-oriented data storage; row-oriented database systems; Compression algorithms; Computer architecture; Costs; Database systems; Encoding; Engines; Laboratories; Memory; Query processing; System performance; RLE; column-based compression; row-oriented database; storage;
Conference_Titel :
Cyber-Enabled Distributed Computing and Knowledge Discovery, 2009. CyberC '09. International Conference on
Conference_Location :
Zhangijajie
Print_ISBN :
978-1-4244-5218-7
Electronic_ISBN :
978-1-4244-5219-4
DOI :
10.1109/CYBERC.2009.5342213