Title of article
Reordering columns for smaller indexes
Author/Authors
Daniel Lemire، نويسنده , , Owen Kaser، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2011
Pages
21
From page
2550
To page
2570
Abstract
Column-oriented indexes—such as projection or bitmap indexes—are compressed by run-length encoding to reduce storage and increase speed. Sorting the tables improves compression. On realistic data sets, permuting the columns in the right order before sorting can reduce the number of runs by a factor of two or more. Unfortunately, determining the best column order is NP-hard. For many cases, we prove that the number of runs in table columns is minimized if we sort columns by increasing cardinality. Experimentally, sorting based on Hilbert space-filling curves is poor at minimizing the number of runs.
Keywords
Gray codes , DATA WAREHOUSING , Indexing , Compression
Journal title
Information Sciences
Serial Year
2011
Journal title
Information Sciences
Record number
1214438
Link To Document