DocumentCode
2298409
Title
Nonuniform Compression in Databases with Haar Wavelet
Author
Chen, S. ; Nucci, A.
Author_Institution
Dept. of Comput. Sci., Rutgers Univ., NJ
fYear
2007
fDate
27-29 March 2007
Firstpage
223
Lastpage
232
Abstract
Data synopsis is a lossy compressed representation of data stored into databases that helps the query optimizer to speed up the query process, e.g. time to retrieve the data from the database. An efficient data synopsis must provide accurate information about the distribution of data to the query optimizer at any point in time. Due to the fact that some data will be queried more often than others, a good data synopsis should consider the use of nonuniform accuracy, e.g. provide better approximation of data that are queried the most. Although, the generation of data synopsis is a critical step to achieve a good approximation of the initial data representation, data synopsis must be updated over time when dealing with time varying data. In this paper, we introduce new Haar wavelet synopses for nonuniform accuracy and time-varying data that can be generated in linear time and space, and updated in sublinear time. The efficiency of our new data synopses is validated against other linear methods by using both synthetic and real data sets
Keywords
Haar transforms; data compression; image coding; image representation; wavelet transforms; Haar wavelet; data synopsis; lossy compressed representation; nonuniform compression; query process; time-varying data; Approximation algorithms; Approximation error; Computer science; Cost function; Data compression; Data structures; Database systems; Distributed databases; Information retrieval; Query processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference, 2007. DCC '07
Conference_Location
Snowbird, UT
ISSN
1068-0314
Print_ISBN
0-7695-2791-4
Type
conf
DOI
10.1109/DCC.2007.59
Filename
4148761
Link To Document