Title :
Re-encoding of perceptually quantized wavelet packet transform coefficients of audio and high quality speech
Author :
Ghahabi, Omid ; Savoji, Mohammad H.
Author_Institution :
Dept. of Electr. & Comput. Eng., Shahid Beheshti Univ., Tehran, Iran
Abstract :
This paper reports on the results of four re-encoding schemes on perceptually quantized wavelet packet transform (WPT) coefficients of audio and high quality speech. These schemes comprises: 1- embedded zero-tree wavelet (EZW) 2- The set partitioning in hierarchical trees (SPIHT) 3-JPEG-based entropy/run length Huffman and 4-JPEG-type audio Huffman coding algorithms. Since EZW and SPIHT are designed for image compression, some new modifications have been implemented in these schemes for their better matching with audio signals. The performances of these four re-encoders are compared in terms of average output bit rate and computation time of a same codec. It is concluded that the JPEG-type audio huffman coding achieves the best results although it is not possible to truncate the bit stream, in this case, to easily match the bit rate to the fixed channel capacity.
Keywords :
Huffman codes; audio coding; data compression; image coding; speech coding; wavelet transforms; JPEG-based entropy; audio Huffman coding algorithm; audio signals; embedded zero-tree wavelet; fixed channel capacity; high quality speech; image compression; quantized wavelet packet transform coefficient; reencoding schemes; run length Huffman coding algorithm; set partitioning in hierarchical trees; Bit rate; Codecs; Entropy; Huffman coding; Image coding; Partitioning algorithms; Signal design; Speech; Wavelet packets; Wavelet transforms; EZW; JPEG; Perceptually Audio Compression; SPIHT; Wavelet Packet Transform (WPT);
Conference_Titel :
Digital Signal Processing, 2009 16th International Conference on
Conference_Location :
Santorini-Hellas
Print_ISBN :
978-1-4244-3297-4
Electronic_ISBN :
978-1-4244-3298-1
DOI :
10.1109/ICDSP.2009.5201066