Title :
Approximation and Compression With Sparse Orthonormal Transforms
Author :
Sezer, Osman Gokhan ; Guleryuz, Onur G. ; Altunbasak, Yucel
Author_Institution :
Mobile Processor Innovation Lab., Samsung Mobile, Richardson, TX, USA
Abstract :
We propose a new transform design method that targets the generation of compression-optimized transforms for next-generation multimedia applications. The fundamental idea behind transform compression is to exploit regularity within signals such that redundancy is minimized subject to a fidelity cost. Multimedia signals, in particular images and video, are well known to contain a diverse set of localized structures, leading to many different types of regularity and to nonstationary signal statistics. The proposed method designs sparse orthonormal transforms (SOTs) that automatically exploit regularity over different signal structures and provides an adaptation method that determines the best representation over localized regions. Unlike earlier work that is motivated by linear approximation constructs and model-based designs that are limited to specific types of signal regularity, our work uses general nonlinear approximation ideas and a data-driven setup to significantly broaden its reach. We show that our SOT designs provide a safe and principled extension of the Karhunen-Loeve transform (KLT) by reducing to the KLT on Gaussian processes and by automatically exploiting non-Gaussian statistics to significantly improve over the KLT on more general processes. We provide an algebraic optimization framework that generates optimized designs for any desired transform structure (multiresolution, block, lapped, and so on) with significantly better n-term approximation performance. For each structure, we propose a new prototype codec and test over a database of images. Simulation results show consistent increase in compression and approximation performance compared with conventional methods.
Keywords :
Gaussian processes; Karhunen-Loeve transforms; approximation theory; data compression; image representation; multimedia communication; optimisation; video codecs; video coding; Gaussian process; KLT; Karhunen-Loeve transform; SOT; algebraic optimization framework; compression-optimized transform generation; data-driven setup; image codec; multimedia signal; next-generation multimedia application; non-Gaussian statistics; nonlinear approximation; nonstationary signal statistic; sparse orthonormal transform; Algorithm design and analysis; Gaussian processes; Image coding; Linear approximation; Quantization (signal); Transforms; Sparse orthonormal transforms; image compression; linear representation; machine learning; nonlinear approximation; sparse lapped transforms; sparse multi-resolution transforms; sparse orthonormal transforms; transform optimization;
Journal_Title :
Image Processing, IEEE Transactions on
DOI :
10.1109/TIP.2015.2414879