• Title of article

    Reordering columns for smaller indexes

  • Author/Authors

    Daniel Lemire، نويسنده , , Owen Kaser، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2011
  • Pages
    21
  • From page
    2550
  • To page
    2570
  • Abstract
    Column-oriented indexes—such as projection or bitmap indexes—are compressed by run-length encoding to reduce storage and increase speed. Sorting the tables improves compression. On realistic data sets, permuting the columns in the right order before sorting can reduce the number of runs by a factor of two or more. Unfortunately, determining the best column order is NP-hard. For many cases, we prove that the number of runs in table columns is minimized if we sort columns by increasing cardinality. Experimentally, sorting based on Hilbert space-filling curves is poor at minimizing the number of runs.
  • Keywords
    Gray codes , DATA WAREHOUSING , Indexing , Compression
  • Journal title
    Information Sciences
  • Serial Year
    2011
  • Journal title
    Information Sciences
  • Record number

    1214438