Title :
Fixed-length string compression for direct operations in column-oriented databases
Author :
Ke Yan ; Meiyi Xie ; Hong Zhu
Author_Institution :
Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Abstract :
Compression is one of the most important techniques in column-oriented database systems development. For fixed-length string typed columns, both heavyweight and lightweight compression schemes have limitations. In this paper, we propose a compression scheme, called FSC (Fixed-length String Compression), to achieve good compression ratio and support direct queries on compressed data without decompression in advance. The main idea of FSC is to vertically partition a fixed-length string typed column into sub-columns, which are compressed by different lightweight compression methods. Moreover, we present a search method, which are called FSC-search, to search on compressed data directly. Intensive experiments show that FSC not only achieves good compression ratio, but also improve query performance by supporting direct searching on compressed data.
Keywords :
data compression; query processing; search problems; string matching; column-oriented database systems development; data compression; direct operations; fixed-length string compression; fixed-length string typed column; lightweight compression scheme; search method; Arrays; Dictionaries; Educational institutions; Encoding; Pattern matching; Power capacitors; Vectors; column-oriented database; compression; decompression; fix-length string; query;
Conference_Titel :
Natural Computation (ICNC), 2013 Ninth International Conference on
Conference_Location :
Shenyang
DOI :
10.1109/ICNC.2013.6818155