DocumentCode
2522133
Title
Column-based RLE in row-oriented database
Author
An, Mingyuan
Author_Institution
Key Lab. of Comput. Syst. & Archit., Grad. Univ. of Chinese Acad. of Sci., Beijing, China
fYear
2009
fDate
10-11 Oct. 2009
Firstpage
309
Lastpage
315
Abstract
In database systems, disk I/O performance is usually the bottleneck of the whole query processing. Among many techniques, compression is one of the most important ones to reduce disk accesses so to improve system performance. RLE (run-length encoding) is one light-weight compression algorithm which incurs negligible CPU cost. A lot of work show that, although RLE is one of the most effective compression techniques in column-oriented systems, it is very hard to use due to bad value locality in row-oriented systems where values from multiple attributes are stored in the same page. We propose CRLE (Column-based RLE), one compression algorithm to apply RLE to row-oriented data storage. On row-oriented storage page, CRLE can exploit value locality in individual column and encode values from the same column in run-length format. Experiments show that CRLE can lead to very good compression ratio and performance in spite of row-oriented data storage.
Keywords
database management systems; query processing; column based run-length encoding; disk I-O performance; query processing; row-oriented data storage; row-oriented database systems; Compression algorithms; Computer architecture; Costs; Database systems; Encoding; Engines; Laboratories; Memory; Query processing; System performance; RLE; column-based compression; row-oriented database; storage;
fLanguage
English
Publisher
ieee
Conference_Titel
Cyber-Enabled Distributed Computing and Knowledge Discovery, 2009. CyberC '09. International Conference on
Conference_Location
Zhangijajie
Print_ISBN
978-1-4244-5218-7
Electronic_ISBN
978-1-4244-5219-4
Type
conf
DOI
10.1109/CYBERC.2009.5342213
Filename
5342213
Link To Document