Title :
Compressing high dimensional datasets by fractals
Author :
Wu, Xintao ; Barbará, Daniel
Author_Institution :
North Carolina Univ., Charlotte, NC, USA
Abstract :
Summary form only given. Fractal technique extensions to general datasets were proposed. The high-dimensional dataset can be viewed as a collection of cells that represent some measure on an integer n-D grid. The measure over different scales along the dimension´s hierarchies may exhibit self-similarities. A two-phase searching strategy was applied to overcome the increased searching time caused by additional dimensions. The search scheme checks a small number of spatially close local domain chunks. The data structure used is defined by 2n-tree which is a natural extension of quadtree (for image) and cotree (for volume). Each node, corresponding to a range chunk or a domain chunk, contains the summary information used for local matching. The experimental results have shown that the performance of fractal compression is comparable with rivals such as nonlinear model. The experiments over synthetic datasets have shown that the scalability of fractal compression techniques displays self-similar characteristics. To overcome high time complexity caused by additional dimensions, approximate multi-dimensional nearest neighbors searching techniques were presented that run in expected logarithmic time.
Keywords :
data compression; fractals; search problems; tree data structures; cells; cotree; dataset compression; dimension hierarchies; domain chunks; fractal compression; fractals extensions; high dimesional dataset; integer n-D grid; local matching; logarithmic time; neighbor searching scheme; quadtree; range chunk; searching time; self similarity; time complexity; two phase searching strategy; Aggregates; Cities and towns; Fractals; Image coding; Image resolution; Image sampling; Marketing and sales; Statistics; Telecommunication traffic; Traffic control;
Conference_Titel :
Data Compression Conference, 2003. Proceedings. DCC 2003
Print_ISBN :
0-7695-1896-6
DOI :
10.1109/DCC.2003.1194071