DocumentCode :
2139278
Title :
Fixed-length string compression for direct operations in column-oriented databases
Author :
Ke Yan ; Meiyi Xie ; Hong Zhu
Author_Institution :
Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
fYear :
2013
fDate :
23-25 July 2013
Firstpage :
1171
Lastpage :
1176
Abstract :
Compression is one of the most important techniques in column-oriented database systems development. For fixed-length string typed columns, both heavyweight and lightweight compression schemes have limitations. In this paper, we propose a compression scheme, called FSC (Fixed-length String Compression), to achieve good compression ratio and support direct queries on compressed data without decompression in advance. The main idea of FSC is to vertically partition a fixed-length string typed column into sub-columns, which are compressed by different lightweight compression methods. Moreover, we present a search method, which are called FSC-search, to search on compressed data directly. Intensive experiments show that FSC not only achieves good compression ratio, but also improve query performance by supporting direct searching on compressed data.
Keywords :
data compression; query processing; search problems; string matching; column-oriented database systems development; data compression; direct operations; fixed-length string compression; fixed-length string typed column; lightweight compression scheme; search method; Arrays; Dictionaries; Educational institutions; Encoding; Pattern matching; Power capacitors; Vectors; column-oriented database; compression; decompression; fix-length string; query;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation (ICNC), 2013 Ninth International Conference on
Conference_Location :
Shenyang
Type :
conf
DOI :
10.1109/ICNC.2013.6818155
Filename :
6818155
Link To Document :
بازگشت